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Abstract 


The subject of this thesis is the coupling of quantum fields to a classical gravi¬ 
tational background in a semiclassical fashion. It contains a thorough introduction 
into quantum field theory on curved spacetime with a focus on the stress-energy 
tensor and the semiclassical Einstein equation. Basic notions of differential geometry, 
topology, functional and microlocal analysis, causality and general relativity will be 
summarised, and the algebraic approach to quantum field theory on curved spacetime 
will be reviewed. The latter part contains an introduction to the framework of locally 
covariant quantum field theory and relevant quantum states: Hadamard states and, 
on cosmological spacetimes, adiabatic states. Apart from these foundations, the 
original research of the author and his collaborators will be presented: 

Together with Fewster, the author studied the up and down structure of circular 
and linear permutations using their decomposition into so-called atomic permutations. 
The relevance of these results to this thesis is their application in the calculation of 
the moments of quadratic quantum fields in the quest to determine their probability 
distribution. 

In a work with Pinamonti, the author showed the local and global existence 
of solutions to the semiclassical Einstein equation in flat cosmological spacetimes 
coupled to a massive conformally coupled scalar field by solving simultaneously 
for the quantum state and the Hubble function in an integral-functional equation. 
The theorem is proved with the Banach fixed-point theorem using the continuous 
functional differentiability and boundedness of the integral kernel of the integral- 
functional equation. 

Since the semiclassical Einstein equation neglects the quantum nature of the 
stress-energy tensor by ignoring its fluctuations, the author proposed in another work 
with Pinamonti an extension of the semiclassical Einstein equations which couples 
the moments of a stochastic Einstein tensor to the moments of the quantum stress- 
energy tensor. In a toy model of a Newtonianly perturbed exponentially expanding 
spacetime it is shown that the quantum fluctuations of the stress-energy tensor induce 
an almost-scale-invariant power spectrum for the perturbation potential and that 
non-Gaussianties arise naturally. 
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Introduction 


The subject of this thesis is the interplay between quantum matter and gravity, i.e., the 
coupling of quantum fields to a classical gravitational background in a semiclassical 
fashion. Semiclassical gravity describes physics midway between the classical regime 
covered by the Einstein equation and a full-fledged quantum gravity. However, while 
no theory of quantum gravity is universally accepted, quantum field theory on curved 
spacetimes offers an approach to the ‘low’-energy and ‘small’-curvature regime based 
on the firm foundation of quantum field theory and Lorentzian geometry. Despite of 
the employed approximation, it has made several successful and relevant predictions 
like the Fulling-Davies-Unruh effect [67, 105, 214], the Hawking effect [100, 118], 
cosmological particle creation [171] and the generation of curvature fluctuations 
during inflation [108, 120, 161, 202], It is generally expected that a successful theory 
of quantum gravity also describes these phenomena and, in fact, they are used as 
criteria to select candidate theories. 

Although quantum gravity is one motivation for studying quantum field theory 
on curved spacetime, it is not the only reason. While quantum field theory is typically 
formulated on a Minkowski background, the Universe appears well-described by a 
curved spacetime and Minkowski spacetime provides only a local approximation. 
However, even the slightest gravitational interaction causes many of the basic as¬ 
sumptions of ‘standard’ quantum field theory on Minkowski spacetime to fail and in 
important situations, like inflation, the departure from a flat background is not small 
but causes important effects that cannot be neglected. From this point of view it 
would be conceptually very unsatisfying if it was not possible to successfully formulate 
quantum field theory on a curved spacetime in such a way that it reduces to standard 
QFT in the case of a flat background. 

Attempts to formalize quantum field theory in a mathematically exact manner 
have led to many significant insights into the structure of quantum fields: the CPT 
theorem, the spin-statistics connection, and superselection sectors to name a few, 
see e.g. [110, 206]. By studying aspects of semiclassical gravity and quantum field 
theory on curved spacetimes in the rigorous framework of algebraic quantum field 
theory, one hopes to gain deep and novel insights into the subtle nature of quantum 
fields on curved spacetimes and at the same time often prove theorems that have 
also a purely mathematical value. Moreover, as a consequence of the correspondence 
principle, it is highly plausible that a careful investigation of the semiclassical theory 
gives us further hints about the structure of an eventual theory of quantum gravity. 
In particular, one can expect that observations in cosmology are already described to 
high precision within semiclassical Einstein gravity and that tight limits can be placed 
on the creation of extreme objects such as wormholes in generic spacetimes. 

In the formulation of quantum field theory on Minkowski spacetime one usually 
starts the with the unique Poincare-invariant vacuum state as the ground state in 
a Fock space motivated by the particle interpretation. On a generic spacetime, due 
to the absence of any symmetries, no such distinguished state can exist and, as 
illustrated in the Unruh and the Hawking effect, no unique particle interpretation is 
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available. This suggests that the starting point for a quantum field theory on curved 
spacetimes should be a formulation that does not require a preferred state. For 
this reason, rigorous quantum field theory on curved spacetimes is often discussed 
within the algebraic approach to quantum field theory [109-111]. In the algebraic 
approach one begins by considering an abstract algebra of quantum fields, which 
respects conditions of locality (quantum fields only depend on the local structure of 
the spacetime) and causality (causally separated quantum fields (anti)commute). 

A modern formulation of QFT on curved spacetimes is the framework of locally 
covariant quantum field theory [46], It helped the development of several important 
contributions to QFT on curved spacetimes like renormalization and perturbative 
algebraic quantum field theory [43, 101, 102], superselection sectors on curved 
spacetimes [47], abstract and concrete results on gauge theories [30-32, 59, 86, 194] 
and many other results. In this framework one considers quantum field theories as 
covariant functors from a category of background structures to a category of physical 
systems. In most simple examples the background structure is given by a category of 
globally hyperbolic spacetimes with so-called hyperbolic embeddings as morphisms, 
but the background structure can be replaced by anything reasonable that allows 
for a categorical formulation, see [2, 177] for examples of alternative choices. A 
suitable category representing physical systems for algebraic quantum field theory 
is a category of *-algebras so that a quantum field theory maps a spacetime to an 
algebra of observables in that spacetime. 

However, while states, viz., positive linear functionals on a *-algebra, are not 
necessary for the formulation of the theory, they are indispensable if one wants to 
make quantitative predictions. Given a state it is possible to return to a Hilbert 
space picture as a representation of the * -algebra of observables via the Gel’fand- 
Naimark-Segal theorem. Not all possible states on the algebra of quantum fields 
are of equal physical importance. Physically and mathematically preferred states 
are the so-called Hadamard states, which have an ultraviolet behaviour analogous 
to that of the Minkowski vacuum. Hadamard states are for example required for a 
reasonable semiclassical Einstein equation; otherwise the fluctuations of the quantum 
stress-energy tensor are not even distributions and the semiclassical Einstein equation 
becomes physically meaningless, because we equate a quantity with a probabilistic 
interpretation and ‘diverging’ fluctuations with a classical non-fluctuating quantity. 
Major advances in quantum field theory on curved spacetime were achieved after it 
was realized in [181] that all Hadamard states satisfy a constraint on the wavefront 
set of the n-point functions of the state. This constraint was called microlocal 
spectrum condition in allusion to the condition from Wightman quantum field theory 
on Minkowski spacetime. In particular, this discovery led to the formulation of a 
rigorous theory of renormalization and a concept of normal ordering on curved 
spacetimes [44, 45, 123, 124]. 

The developments of quantum field theory on curved spacetimes were often 
driven by problems related to the semiclassical Einstein equation. In the semiclassical 
Einstein equation contains instead of a classical stress-energy tensor the expectation 
value of a quantum stress-energy tensor :T ab : in a certain state co: 
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The quantum stress-energy tensor may be obtained by replacing the products of 
classical fields in the classical stress-energy tensor by normally ordered products of 
quantum fields. This requires the notion of normal ordering on curved spacetimes 
mentioned above. The resulting quantum stress-energy is not uniquely fixed but, due 
to the non-uniqueness of the normal ordering prescription, subject to a renormaliza¬ 
tion freedom, which is a polynomial of local geometric quantities, whose coefficients 
are called renormalization constants. 

Quantum field theory on curved spacetimes is best understood in a few special 
cases of highly symmetric spacetimes. In particular quantum fields on Friedmann- 
Lemaitre-Robertson-Walker spacetimes are well-studied as they are important in 
quantum cosmology. Nevertheless, already in this simplified case many interesting 
effects occur, for example the creation of particles in an expanding spacetime [171]. 
Due to the developments discussed above, in recent years computations of quantum 
field theoretic effects in cosmological spacetimes and their backreaction to the space- 
time via the semiclassical Einstein equation have come into the reach of the algebraic 
approach to QFT on curved spacetimes. 

A first step towards doing cosmology in algebraic QFT on curved spacetimes is 
often the construction of appropriate states. Noteworthy recent works are the holo¬ 
graphic (or bulk-to-boundary) construction [57, 61-65, 1, 157, 159] and the states 
of low energy [165, 210] (see also [70, 71]) which are a Bogoliubov transformation 
of adiabatic states [134, 146, 172]. Given a state, one can study the semiclassical 
Einstein equation to study the backreaction effects of quantum matter fields; this 
has been done, for example, in [56, 66, 112, 114]. Going one step further, one can 
attempt to solve this semiclassical Einstein equation, i.e., finding a spacetime and a 
state on that spacetime so that the equation holds. This problem was analyzed for 
cosmological spacetimes in [83, 178, 3], Other works studied linearized gravity [92, 
113], inflation [81] (see also [4] for a non-standard approach) and other cosmologi¬ 
cal models [221] in the algebraic framework. Furthermore, several researches have 
studied thermal aspects of quantum fields on curved spacetime, [49, 82, 196, 197] to 
name a few, which are arguably of importance to quantum cosmology. 

In this thesis several aspects of the works cited above will be summarized and, 
when necessary, developed further. To give this work a clearer structure, it is divided 
into three parts. 

The first part is mostly intended to lay the foundations of the remaining two parts. 
In Chap. 1 a rapid summary of subjects from differential geometry relevant to QFT 
on curved spacetimes is presented but it also contains a few sections and remarks on 
subjects which are usually not covered in standard text books on differential geometry, 
e.g. bitensors. Chap. 2 focuses on the particular case of Lorentzian geometry including 
notions of causality, the classical Einstein equation and cosmology. Analysis, in the 
broadest sense, will be the subject of Chap. 3 and in that chapter various results on 
topology, *-algebras, functional derivatives and their relation to the Banach fixed- 
point theorem, microlocal analysis and wave equations will be summarised. In favour 
of not jumping back and forth between different subjects in these three sections I 
chose a rather unpedagogical order and the reader should be aware that there are 
many interrelations between the various sections. This should, however, not be a too 
large an obstacle for the reader. Chap. 4 concerns the enumerative combinatorics 
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of permutations and appears somewhat unrelated to most of this thesis. However, 
combinatorics is very important in many applications of quantum field theory and the 
results presented in this chapter are important in the moment problem for quadratic 
quantum fields [90] . The contents of this last chapter represent work by the author 
in collaboration with Fewster and were published in [95]. 

In the second part of this thesis several aspects of quantum field theory on 
curved spacetimes will be discussed. It begins with an introduction to the categorical 
framework of locally covariant quantum field theory with an emphasize on the field 
algebra of Klein-Gordon-like quantum fields in Chap. 5. In Chap. 6 we discuss 
quantum states and in particular the construction of adiabatic states of cosmological 
spacetimes and the holographic construction of Hadamard states on asymptotically 
flat spacetimes. 

The third and last part of this thesis represents the largest portion of novel research 
done during the authors Ph.D. studies; here the semiclassical Einstein equation will 
be analyzed in detail. The basic notions including the stress-energy tensor for the 
scalar field and its renormalization will be introduced in Chap. 7. In Chap. 8 the 
proof of the author and Pinamonti [3] on the local and global existence of solutions 
to the semiclassical Einstein equation will be presented. Finally, in the last chapter of 
this thesis (Chap. 9), the fluctuations of the stress-energy tensor will be analyzed and 
how their backreaction to the metric may be accounted for; this work in collaboration 
with Pinamonti was already presented in [4]. 

This thesis will be concluded with some final remarks on the research presented 
above in the conclusions. Following this the cited references may be found in the 
bibliography, starting with the works (co) authored by the present author. 


Foundations 


Zudem ist es ein Irrtum zu glauben, dafi die Strenge in der Beweisfilhrung 
die Feindin der Einfachheit ware. An zahlreichen Beispielen finden wir im 
Gegenteil bestatigt, daJS die strenge Methode auch zugleich die einfachere 
und leichter fafiliche ist. Das Streben nach Strenge zwingt uns eben zur 
Auffindung einfacherer Schluftweisen; auch bahnt es uns haufig den Weg 
zu Methoden, die entwickelungsfahiger sind als die alten Methoden von 
geringerer Strenge. 

— David Hilbert, “Mathematische Probleme” (1900), p. 257. 




Differential geometry 


Summary 

This chapter is mostly a summary of some common definitions and standard results 
on differential geometry and most of its content can be safely skipped by a reader 
well acquainted with the topic. Proofs are omitted everywhere except in the last 
section and may be found in any text book on differential geometry. Nevertheless, 
the author has attempted to present the material in such a way that many statements 
should become self-evident, although, as always, care should be taken. 

In the first section (Sect. 1.1) the basic theory of differentiable manifolds and 
vector bundles is summarized. Here the notions of coordinates, maps between 
manifolds, vector bundles and sections, the (co)tangent bundle, (co)vectors and 
(co)vector fields, curves, tensor and exterior tensor product bundles, bundle metrics, 
frames, differential operators, and the index notation are explained. The second 
section (Sect. 1.2) is concerned with the definition of connections on vector bundles 
and the objects that follow from this. That is, it discusses the notions of curvature, 
geodesics, and the slightly unrelated concept of Killing vector fields. Differential 
forms and integration are introduced in the third section (Sect. 1.3). In particular we 
will introduce the de Rham cohomology, the Hodge star and the dual of the exterior 
derivative, the codifferential, which leads to the definition of the Laplace-de Rham 
operator, and close with a short discussion of integral manifolds. 

In the presentation of these three sections the author follows partially that of [142] 
and also [6], but these standard definitions may be found in many places in the 
literature. 

The fourth and last section (Sect. 1.4) treats a more obscure topic: bitensors. 
Bitensors are already introduced in an abstract manner in Sect. 1.1.6; a simple, yet 
important, example are biscalars: functions on the product M x M of a manifold M. 
In this last section concrete and important cases of bitensors such as Synge’s world 
function and the van Vleck-Morette determinant are discussed. It will also form the 
foundation for the discussion of the Hadamard coefficients in Sect. 6.1.3. An excellent 
resource on bitensors is the review article [179], which contains most of the first 
part of this section. The second part of this section is concerned with computational 
methods that help the calculation of coincidence limits of bitensors. Here we will 
discuss the semi-recursive Avramidi method developed in [169]. We close this section 
with a recursive method to calculate the coefficients of an asymptotic expansion of 
Synge’s world function in coordinate separation. To the authors knowledge, this 
simple and efficient method has never been fully developed but traces of it may be 
found in [168]. 
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Figure 1.1 . Two overlapping charts and their transition map. 

1.1 Differentiable manifolds and vector bundles 

A topological manifold of dimension n is a second-countable Hausdorff space M that 
is locally homeomorphic to M" (i.e., each point of M has a neighbourhood that is 
homeomorphic to an open subset of R n ). We often omit the dimension of the manifold 
and simply say: M is a topological manifold. 

Since a topological manifold M is locally homeomorphic to R n , we can assign 
coordinates to points of M in each open neighbourhood U c M: A (coordinate) chart 
of M is a pair (U,ip) which gives exactly such a homeomorphism 

if : U -» </?((/) C 

we call U its coordinate neighbourhood. The component functions (x 1 ,..., x n ) = 
are called (local) coordinates on U. 

An atlas A of M is a family of charts ([/;, ip ; ) ie pj which cover M. If any two over¬ 
lapping charts ( U, <p), ( V , xp) in an atlas are smoothly compatible, viz., the transition 
map 

xpoy - 1 : y(un v) -> xp(un v) 

is a smooth, bijective map with a smooth inverse (Fig. 1.1), we say that the atlas is 
smooth. We further say that a smooth atlas A on M is maximal if it is not properly 
contained in any larger smooth atlas so that any chart which is smoothly compatible 
with the charts of A is already contained in A. 

Finally, a smooth manifold is a pair (M,A), where M is a topological manifold 
and A a maximal smooth atlas. A maximal smooth atlas might not exist and, if it exists, 
it is not necessarily unique as shown, e.g., by the existence of exotic R 4 . Nevertheless, 
usually a canonical smooth atlas is understood from context. Then we omit the 
explicit mention of the maximal smooth atlas A and say: M is a smooth manifold. 
One can replace the requirement of the transition maps in an atlas to be smooth 
by requiring that the transition maps are C k , (real-)analytic or complex-analytic (if 
dimM = 2 n, we have IR 2n ~ C n ) thus arriving at the notions of C k , (real-)analytic 
and complex-analytic manifolds. 
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Figure 1.2. A map between two manifolds. 

1.1.1 Smooth maps 

A real-valued function / : M —* K on a smooth manifold M is a C k , smooth or analytic 
function if there exists a chart (17, cp) containing x at every x e M such that the 
composition / o ip” 1 is C fc , smooth or analytic on the image (/?([/); the spaces of these 
function are denoted C k (M), C°°(M) and C"(M) respectively. 

More generally, a map / : M —* N between two smooth manifolds M and N is C k , 
smooth or analytic, if there exist charts ([/, i/j) at x 6 M and (V, ip) at /(x) e N such 
that the composition 

t/j of o (p -1 : (p([/) —»i/>(V) 

is C k , smooth or analytic (Fig. 1.2). If M and N have equal dimension and / is a 
homeomorphism such that / and its inverse f~ l are smooth, we call / : M —* N 
a dijfeomorphism. Whenever there exists such a diffeomorphism between, they are 
dijfeomorphic; in symbols M ~ N. 

1.1.2 Vector bundles 

A (smooth) K-vector 1 bundle of dimension n 

n : E —» M 

consists of two smooth manifolds E, the total space, and M, the base (space), and 
a smooth surjection n, the bundle projection, that associates to every x e M a n- 
dimensional K-vector space E x = 7t _1 ({x}), the fibre of E at x (Fig. 1.3). Moreover, 
we require that around every x there exists an open neighbourhood U c M and a 
diffeomorphism p : (U) —> U x E x such that its projection to the first factor gives 

the bundle projection: pr x o ip = n; p is called local trivialization of the vector bundle. 
A trivialization of a vector bundle over its whole base is called a global trivialization. 

A smooth map / : E —* F between two K-vector bundles n M : E —> M and n N : 
F —> N is a (vector) bundle homomorphism if there exists a smooth map g : M —* N 


1 K will always be either R or C. In particular, IK is a field of characteristic 0. 
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such that n N of = gon M and the restriction of the map to each fibre / ^ : E x —> F g( - X ) 
is K-linear. In other words we require that the diagram 

/ 

E -> 

n M 

M -> 

8 

commutes and that / is K-linear map on each fibre. By definition, if / is a bundle 
homomorphism, then g is given by g = n N o / o 7r~ 1 . 

Given an open subset U c M, we can restrict n : E —* M to a vector bundle n v : 
E v — * U by setting E v = 7t _1 ([/) and n v = Ttfy. More generally, a subset £ / c E such 
that n\ E i : E' —» M is a vector bundle and E 7 n 7t _1 (x) is a vector subspace in 7t _1 (x) 
for all x e M is called a vector subbundle of E. If each fibre of E' has dimension k, we 
say that E' is a rank-fc subbundle of E. For a subbundle n' : E' —* M of E we define 
the quotient bundle E/E' as the disjoint union |_| xeM E x /E' x of the quotient spaces of 
the fibres. 

If M,N are smooth manifolds with a smooth map ip : M —* N and a vector- 
bundle E —* N, we can define on M the pullback bundle xp*E as the bundle whose 
fibres over M are given by (t/)*E) x = E^^ for each x e M. 

A section of a vector bundle E is a continuous map / : M —» E such that nof = id M 
(Fig. 1.3); the space of sections of a vector bundle E is denoted by r(£). We denote 
by r n (£) the space of C n sections (C n maps / : M —* E) of the vector bundle E, while 
the spaces of compactly supported sections are indicated by a subscript 0, e.g., r“(E). 
Furthermore, a local section over an open subset FcMisa section of the vector 
bundle E v . 

If N is another smooth manifold with a vector-bundle F —* N and there exists a 
smooth map ip : M —* N, the pullback section xp*f S ip*F of / e F(F) is defined as 
the section t p*f = f o ip. The opposite of the pullback is achieved by the pushforward 
if ip is a diffeomorphism. Namely, we will say that 'ip ji = h o ip ~ 1 is the pushforward 




Figure 1.3. A vector bundle and a section. 
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Figure 1.4. The tangent space of a manifold at a point and a tangent vector. 

section of h e r(£). If h is compactly supported in a region U c M, it suffices that xp 
is a diffeomorphism onto the image of U; outside of t p(U) we set xpjt identical to 
zero. 

1.1.3 Tangent bundle 

Let M be a smooth manifold. A (tangent) vector at x e M is a linear map v : C°°(M) —> 
M that satisfies the Leibniz ‘product’ rule 

v(f g)=f(x)vg + g(x)vf 

for all /,je C°°(M). The set of all tangent vectors constitutes a vector space T X M 
called the tangent space to M at x; it has the same dimension as the base manifold M 
for each x e M. Note that, if a smooth manifold M is also a vector space, then we 
can identify it with its tangent space, be., M ~ T X M at each point x, which justifies 
to the geometric visualization of the tangent space (Fig. 1.4). 

The vector bundle TiW->M with fibres T X M at x is the tangent bundle of M. In 
the special situation where the n-dimensional smooth manifold M can be covered by 
a single chart, TM is diffeomorphic to M xl". 

The sections r(TM) of the vector bundle are called vector fields. Applying a vector 
field v e r(TM) to a function / e C°°(M), we obtain a new function (v/)(x) > v x f , 
viz. , a vector field defines a linear automorphism on the smooth functions called a 
derivation. This is exactly the Lie derivative S £ v f of a function / e C°°(M) along a 
vector field v: (!T v /)(v) = (v/)(x). The Lie derivative S£ v w of a differentiable vector 
field w with respect to another differentiable vector field v is another vector field 
such that 

2? v w = [v, w] = v o w — w o v 


when applied to smooth functions. It satisfies the Leibniz rule and the Jacobi identity 
w) = (2 v /)w + /se v w, S£ u [v, w] = [S£ u v, w] + [v, S e u w] 

for all / e C°°(M) and vector fields iz, v, w € r°°(TM). 

Given two smooth manifolds M,N and a smooth map F : M — »iV, we can define 
at each point x e M the tangent map or differential of F at x as the linear map 

T X F : T X M —> T F ( x} N, 
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Figure 1.5. The tangent map at a point, 
see also Fig. 1.5, which is for every v e T X M and f e C°°(M) the derivation 

VW(/)=v(/oF). 

The tangent maps off at all points taken together form TF : TM —» TN, the (global) 
tangent map or differential of F. If N = K, i.e., F is a smooth function on a manifold, 
we note that dF = TF is the usual differential. 

If F is even a diffeomorphism, then TF defines a bijection between the vector 
fields on M and N. In this case one can define the pushforward F*v of a vector field v 
on M by F as the vector field on N given at each x e N by 

(F*v) x Tp-i^F . 

Clearly, this is generally not well-defined if F is not a diffeomorphism. Using that F is 
invertible, a pullback F*w of a vector field w on N can be defined as the inverse of 
the pushforward, namely, 

F*w = (F _1 )*w. 

The tangent map allows us to single out an important type of maps between 
manifolds: Immersions are maps F : M —* N such that TF is injective; if in addition 
F is injective and a homeomorphisms onto its image, then it is called an embedding. 
Consequently we say that a subset S c M is an immersed submanifold if it is a 
topological manifold and the inclusion S ■—* M is an immersion; if the topology of S 
is the subspace topology and the inclusion is an embedding, then S is called an 
embedded submanifold. Observe that the tangent space T X S of a submanifold ScM 
is a subspace of T X M at every point x e S. We can then say that a vector field v is 
tangent to S if v x e T X S at every point x e S. 


1.1.4 Curves 

A parametrized curve on a smooth manifold M is a map y : I —* M from a connected, 
usually open, interval I czR into the manifold. 

A curve y : [a,b) —» M is called inextendible if there exists a sequence t n converg¬ 
ing to b such that y(t n ) does not converge. This notion readily extends to left-open 
domains and open domains. 

The velocity f(t ) at t of a differentiable parametrized curve y : I —* M, where 
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I c K is an interval, is the vector 


HO = T t r 



e T m M. 


Working in the opposite direction, we can try to find a curve y, whose velocity at 
every point is determined by a given vector field v: 


fCt) = v r (0- (!•!) 

Such a curve y is called an integral curve of v. 

For a sufficiently small interval I around 0 a unique integral curve starting at 
a point x e M can always be found by solving the differential equation (1.1) in a 
coordinate neighbourhood of x. The domain of an integral curve cannot necessarily 
be extended to the entire real line. We say that a vector field is complete if the domain 
of all of its maximal integral curves, i.e., the integral curves whose domain cannot be 
extended, is the entire real line. 

The flow xp t of a complete vector field v is 


t/hU) = Yx(0, 


where y x is the maximal integral curve starting at x e M. This defines for every t 
a diffeomorphism xp t :M —* M and the collection of all these diffeomorphisms is a 
group {xp t }te r with unit t p 0 , multiplication xp s °t/) t = t p s+t and inverse xpf 1 = xp_ t . 
If v is not complete, then one can still define a local flow xp t around a point x with 
domain U, where U is a neighbourhood of x, and t e I is restricted to an interval 
around 0. 


1.1.5 Cotangent bundle 

At each point x of a smooth manifold M, we define the cotangent space to M at x, 
denoted T*M, as the dual space of tangent space at the same point, namely, 

T*M = (T X M)*. 


The elements of the cotangent space T*M are called (tangent) covectors; naturally 
they are linear functionals on the tangent space. Taking the cotangent spaces at 
each point as fibres, we obtain T*M, the cotangent bundle of M. The sections of 
the cotangent bundle are called covector fields or one-forms (see Sect. 1.3). The 
differentials (at a point) of functions discussed above are examples of covectors resp. 
covector fields. 

A smooth map F : M —» N between smooth manifolds M,N induces at each point 
x e M the map T X F : T X M —* Tp^N between the tangent spaces. By duality one 
can find the transpose map T*F : T*^N —» T*M, called the cotangent map of F at x, 
between the cotangent spaces at F(x) and x, which is given for each v e T X M and 

co e T* ,N as 

F O) 


(T*F(m))(v) = m(r x F(v)). 
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This map then gives the (global) cotangent map T*F : T*N —* T*M and thus facilitates 
the definition of the pullback F*co of a covector field co on N by F as the covector 
field on M given at each x e M by 


(F*co) x = T*F(co fm ). 


Note that, different than the pushforward, the pullback is even defined if F is not a 
diffeomorphism. However, if F is a diffeomorphism, we can define the pushforward 
of a covector field as the pullback via the inverse F _1 . That is, 

F*rj = (F -1 )*p, 

where p is a covector field on M. The pushforward of a compactly supported covector 
field can also be defined if F is an embedding and dim M = dimIV by setting the F^r) 
to zero outside the image of F. 

1.1.6 Tensors product bundles 

Given two K-vector bundles n : E —* M and p : F —* M over the same smooth 
manifold M, we can define the tensor product bundle n®p:E®F^>M, which is 
just the fibrewise tensor product of vector spaces. Namely, the fibre of E ®F at x e M 
is (F ® F) x = (n ® p) _1 ({x}) = E X ® F x . 

In particular, we denote by T q (E) the tensor bundle of type ( p,q ) of a vector 
bundle E: 

TP(E) = E® p ® (F*)® 9 ; 

the tensor bundle of the (co) tangent bundle is simply denoted by TqM instead 
of Tq(TM). The sections of the tensor product bundle T^M are called tensor (fields), 
if p = 0, we say that a tensor is covariant, while we say that it is contravariant if 
q = 0. 

The Lie derivative defined in Sect. 1.1.3 generalizes to covariant tensor fields by 
duality in the following way: Given a covariant tensor field S e F 0 °('F| ) M) and vector 
fields v, w 1; ..., w q e r°°(TM), the 2? V S of S along v can be defined by 

(Se v S)(w l3 ..., w q ) = v (S(w l5 ..., Wq)) - S(S£ v w 1; w 2 ,..., w q ) 

-S(w 1 ,...,Wq_ 1 ,Se v Wq). 

Note that it satisfies the Leibniz rule 

2 V (S ®T) = (2 V S) ®T+S®'E£ V T, 

where T is any other differentiable covariant tensor field. Since the Lie derivative on 
covariant tensor fields defines a Lie derivative on covector fields, this may be used to 
define a Lie derivative on mixed tensor field. 

The pullback or pushforward of a mixed tensor field S e r(T q N) or T e r(T q M ) 
by a diffeomorphism F : M —* N, is defined as the (p + q)-fold tensor product of the 
pullback or pushforward map for (co)vector fields. If S is covariant (i.e., p = 0), then 
the pullback is also defined if F is not a diffeomorphism. 



1.1. Differentiable manifolds and vector bundles 


15 


If F = £, we can define the symmetric and antisymmetric tensor product bundle 
£ © £ and £ A £ as the quotient bundles under the fibrewise equivalence relations 
v © w ~ ±v (8) w for all v, w e E x . More generally, we denote by S p (£) and f\ P (E) the 
p-th (anti)symmetric tensor product bundle which satisfy fibrewise the relation 


v(x 1; ...,x p ) = v(x CT(1) ,..., x a(p) ) or 
v(x 1; ...,x p ) = (sgn a)v(x aW ,..., x a(p) ) 


for v e Ex and all cr e © the symmetric group ofp elements, cf. Sect. 4.1.1. That 
is, the fibres of S p (£) and /\ p (£) are the p-th symmetric (resp. exterior) power of 
the fibres of £. Maps Sym : Tq(£) —* S p (£) and Alt : Tq(£) —* /\ P (E) extend from 
the fibrewise maps 


Sym(v(x 1 ,...,x p )) 

Alt(v(x 1 ,...,x p )) 


1 

p! 

1 

p! 


^ i ^( x cr(l)> • • • j x cr(p)) 

cre6 p 


and 


X! ( S § n cr ) v ( x o-(l), • • •, x a(p)), 

cre6 p 


where v e £® p . Moreover, fibrewise products © : S p (£) x x S 9 (£) x —» S p+q (£) x and 
A : /\ p (£) x x /\ q (£) x —>> /\ P+q (£) x are defined as 


v©w = Sym(v®w) and 


v A w = 


(p +q)\ 

-Alt(v © w). 

p\q\ 


Another possibility to combine two vector bundles £ and £ is the exterior tensor 
product £KI£—>MxM. It is defined as the vector bundle over M x M with fibre 


(£KI£) X = 7i 1 ({x})©p I ({x / }) = £ X ©£ X / 


over the point (x.x')- The sections of the exterior tensor product T ? P MKI T s r M are 
called bitensor (fields). 


1.1.7 Metrics 

Every vector bundle £ has a dual bundle n* : E* —> M which has as its fibre £*, the 
dual vector spaces of the fibres £ x . We call the natural pairing /(v) of an element 
/ e £* in the dual fibre on an element v e £ x in the corresponding fibre a contraction. 

A canonical isomorphism between £ and £* can be constructed if £ carries a 
(bundle) metric, i.e., a map 

(-,•):£ x M £ —> K 

such that the restriction (-,•)* to each fibre £ x is a fibrewise non-degenerate bilinear 
form; a (positive-definite) bundle metric can always be constructed. In other words, 
a metric on £ is a section in the tensor product bundle £(£* © £*). Thus a metric 
induces a metric contraction between two elements of the same fibre £ x . 

The dual metric (•, •)* is the unique metric on E* such that 

(cu,tj)* = (v,uO with to = (v, •), V = 0,0 



16 Chapter 1. Differential geometry 


for all v, w e r(£). Moreover, given metrics (•,-) E and (•, •) F on E and F, they induce 
metrics on the tensor product bundle E ® F and the exterior tensor product bundle 
EMF. For the tensor product bundle it is defined fibrewise for all v e E x , w G F x by 

(v ® W, V ® w)^® F = (v, v) x (w, w) F x 

and can be extended to arbitrary pairings by polarization and linearity; an analogous 
construction works for the exterior tensor product. 

In pseudo-Riemannian geometry we find the tangent bundle equipped with a 
continuous symmetric metric usually denoted 

g : TM x M TM->K 

with dual metric g* on the cotangent bundle. The canonical isomorphism induced 
by g between TM and T*M is given by the musical isomorphisms ‘flat’ \> : TM —» T*M 
and its inverse ‘sharp’ jj : T*M —* TM: 

v b = g(v, •), cJ = g(to, •)• 

A tuple (M, g) of a smooth manifold M with a metric g on its tangent bundle is called 
a pseudo-Riemannian manifold. 

The maximal dimension of subspaces of T X M where g x is negative-definite is 
called the index Ind(g) of g; since g is continuous and non-degenerate, the index 
constant over the manifold. We distinguish in particular two cases: 

(a) If Ind(g) = 0 or, in other words, g is pointwise positive-definite, we say that g 
is a Riemannian metric. 

(b) If Ind(g) = 1 (and the manifold at least two-dimensional), we say that g is a 
Lorentzian metric. 2 

We say that (M, g) is a Riemannian ( Lorentzian ) manifold if the metric g is Riemannian 
(Lorentzian). 

The two prototypical examples for a Riemannian and a Lorentzian manifold 
are Euclidean space and Minkowski space(time): n-dimensional Euclidean space is 
the smooth manifold over K n with a the global chart (K n , id), coordinate functions 
(x 1 ,..., x n ) and with the Euclidean metric 

n 

5 = ^dx ! <g> dx l . 

i=i 

In the conventions chosen here, (l-l-n)-dimensional Minkowski spacetime is a smooth 
manifold modelled on R 1+n with the single chart (IR 1+ri , id), coordinate functions 
(t, x 1 ,..., x n ) and with the Minkowski metric 

n 

ri = — dt (g)dt +^^dx‘ <g>dx ! (1.2) 

! = 1 

2 This choice corresponds to the —I—H- convention, which we will use. In particle physics one often 
adopts the opposite convention H-, i.e., ( M,g ) is Lorentzian if Ind(g) = n — 1. 
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in the coordinate frame. Occasionally one uses the shorthand M to denote four¬ 
dimensional Minkowski space (R 4 , rj). 

Let (M,g M ) and ( N,g N ) be pseudo-Riemannian manifolds and ip : M —» N a 
diffeomorphism. The map xp is called a conformal isometry if 


xp*g N = 17 2 g M 


for some positive function El e C?°(M). Conformal isometries preserve the angles 
between vectors; in particular g N (v, | 0 implies g M (.xp*v,xp*w)^x) | 0 for 

all v, w e r(TlV) and x € M. In the special case that T? = 1, we say that t p is 
an isometry. These notions generalize straightforwardly to immersions and em¬ 
beddings; and we call these maps conformal immersions /embeddings and isometric 
immersions/embeddings. 

1.1.8 Frames 

A local frame of a vector bundle E on M is a set {e^} of smooth local sections r 0O (£ (7 ) 
on a domain U c M contained in a neighbourhood of a local trivialization such 
that {e^fx)} forms a basis for each fibre E x over U. The dual frame is the set {e 
in r°°(£*) that satisfies e M (e v ) = 5 X , i.e., the e^ are the dual basis to e v . Naturally, 
given frames {e^/x)} and {/ v (x)} of vector bundles E,F on M, they induce frames 
{e jU (x) <8>/ v (x)} on the tensor product bundle E <tS)F; the generalization to the exterior 
tensor product bundle EMF is immediate. 

If E is equipped with a metric (■, ■), we say that the frame {e,,} is orthogonal 
(K = R) or unitary (K = C) if it forms an orthonormal basis for each fibre over U, 
i.e., if 

( e IJL ,e v ) = 5 lxv £ lx with = (e M , e^) = ±1. 

Frames allow us to perform calculations in component form, viz., given a sec¬ 
tion s e r(£) its components in the frame {e^} on U are given via the dual frame {e^} 
as s^ = s(e^) such that s = s^e^. This is the first instance where we used the summa¬ 
tion convention: Unless otherwise noted, summation over balanced indices (one upper 
and one lower) is always implied. 

If s is the section of a (exterior) tensor product bundle and frames on the single 
bundles are given, we use multiple indices to denote the sections. For example, if 
g e r(T*M ® T*M ), we can write 

g = Stxv ® dx v 
in terms of the coordinate covectors. 

The atlas of a manifold induces natural local frames on the tangent and the 
cotangent bundle. If (U, cp) is a smooth chart on M in a neighbourhood of x, then 
the coordinate vectors 3 

= (T^r 1 


3 Often we will use the shorthand d jt for d/d x M . 
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define a basis on T X M because T x ip : T X M —> R n ~ is an isomorphism. 
Together with the coordinate functions these coordinate vectors at every point induce 
natural coordinates on (TM) V . A local frame for (T'M) L , is then simply the dual 
frame {dx^}. 

Note that the coordinate vector fields { dp } associated to a coordinate chart {x ,u } 
form a very special local frame of the tangent bundle. Whereas the commuta¬ 
tor [dp, d v ] vanishes, this is no longer true in every frame {e,,}, where 

[ e /x> e v ] — d\ lv e p 

has in general non-vanishing commutation coefficients c f \ tv . 

1.1.9 Differential operators 

Given a vector bundle E —» M of n-dimensional smooth manifold M, a linear differen¬ 
tial operator of order m (with smooth coefficients) is a linear map P : r°°(£) —* r°°(£) 
which, in local coordinate {x ,,J } on U, is given by 

p tt;= X! 

ja|<m 

where a = (a 1; ..., a n ) are multi-indices with d a = d 1 1 • • • d n " and the coefficients 
a a : r°°(£) —* r°°(£) are linear maps. That is, P is locally defined as a polynomial 
in the partial derivatives {dp}. 

The polynomial 

pO,£) = X! 

|a|<m 

where f a = and ^ is a covector field with components ^ = ^dx^, is called 

the total symbol of P. The leading term of p(x, £), 

o- P (x,?)= a a (x)?“, 

|a|=m 

is the principal part or principal symbol of P. While this is not true for the total symbol, 
one can check that the principal symbol is covariantly defined as a function on the 
cotangent bundle: cr P : r°°(S m (T*M) ® E) —» r°°(E). 

Suppose that (M, g) is a pseudo-Riemannian manifold. If the principal symbol cr P 
of a differential operator P is given by the metric 

o' p(x,£) = -g x (£, ?)id £jt , 

we say that P is normally hyperbolic or, alternatively, that it is a wave operator. 
Normally hyperbolic operators on globally hyperbolic spacetime have a well-posed 
Cauchy problem and therefore play an important role in quantum field theory on 
curved spacetime. 

Given a second differential operator Q on the same vector bundle E, the composi¬ 
tion P o Q is also a differential operator and the principal symbol of the composed 

4 Although we will not explicitly state this, sometimes we will use differential operators with 
non-smooth coefficients. In that case the coefficients map into C k sections. 
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operator is given by the composition of the principal symbols: <t Po q = cr P o ctq. We 
say that P is pre-normally hyperbolic if there exists Q such that P o Q is normally hyper¬ 
bolic. One can show that also Q o P is normally hyperbolic and thus Q is pre-normally 
hyperbolic too [160], 

1.1.10 Index notation 

We just saw that in a frame {e^} of a vector bundle E —» M we can calculate the 
components of sections with respect to that frame. For example, given two vectors 
fields v, w e r(TM ) on a pseudo-Riemannian manifold (M,g), we can write their 
metric contraction in terms of their components with respect to the coordinate 
(co)vector fields {d^} and {dx M } in a coordinate neighbourhood: 

g(v,w) = g fJLV v fl w v . 

Repeated indices imply summation by the Einstein summation convention as usual. 

Usually the frame is not explicitly mentioned but instead implicitly given by a 
selection of letters for the indices. Henceforth the small Greek letters p, v, A, p, cr will 
always be indices for a coordinate frame of the (co) tangent bundle in the concrete 
index notation. 

When calculating contractions between more complicated tensors the notation in 
terms of indices is often over the abstract index-free notation which quickly becomes 
unwieldy. Moreover, if the horizontal position of indices is kept fixed, we can use a 
metric to lower and raise indices, e.g., returning to our example, we write 

g(v,w) = V^VlA 

That is, we identify the components of the vector field v with the components of the 
associated covector field g(v, Contracting a tensor S e r(Tj 2 M) with the vector 
fields v, w, we see the advantage of this notation 

S (g(v, •), g(w, •)) = grpgxo' S^v'V'w* = S^v'V w A 

The ‘natural’ position of the indices of the tensor S must, however, be agreed upon 
beforehand. 

Note that the formal aspects of this notation do not necessitate the existence of 
a frame. This leads to Penrose’s abstract index notation. Even in the absence of a 
concrete frame, we write for example 

S (g(v, •), g(w, ■)) = gbdgceS a de V a V b W C = S abc V a V b W C 

Now, an index only labels a slot in the index-free expression and does not carry any 
numerical value. In particular, Einstein summation convention does not apply to 
abstract indices - it would not even make sense - and double indices only imply 
(metric) contractions. We will often use the small Latin letters a, b, c, d, e as abstract 
indices for the (co) tangent bundle. 

Both for abstract and concrete index notation it is useful to introduce some 
shorthands. Symmetrization and antisymmetrization of tensors are denoted by 
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parentheses and brackets: 

S (ab) = |( S ab + Sba S [ab] = |( S ab _ S ba) 

with the obvious generalization to higher-order tensors. Partial and covariant deriva¬ 
tives (see below) are sometimes indicated by comma and semicolon: 

O''),a = 3a(” 0, (•••) ;a = V a (--0. 


1.2 Connections and curvature 

A (Koszul) connection V £ on a m-dimensional K-vector bundle E —* M is a K-linear 
map V £ : r°°(£) —» r°°(£ 0 T*M) that satisfies the Leibniz rule 

v £ (ip/) = v £ / +/ 0 d(p 

for all (f e C°°(M) and / e r°°(£). Every vector bundle admits a connection. 
Henceforth we will often drop the superscript indicating the vector bundle that the 
connection acts on and simply denote it by V. Given a vector field v, the connection 
V defines the covariant derivative along v as V v : T 1 (E) —> T(E) with 

V v • = (V -)(v). 

If, in addition, the vector bundle E is equipped with a C 00 bundle metric (•, •), we say 
that V is a metric connection if 

v(f ,h) = (V v /,h) + (/, V v h) 

holds for all /, h e r 1 (E). A connection V on E = TM is torsion-free if the Lie bracket 
of two vector fields v, w is given by [v, w] = V v w — V w v. 

Let E, F be two vector bundles with connections V £ , V £ and sections / e r 1 (E), 
h e r 1 (F) and u e r 1 (E*). A connection on the the tensor product bundle E 0 F is 
defined by 

Vf £ (/ 0 h) = (V £ /) 0H/0 (V £ h). 

Moreover, a dual connection is obtained from 

(vfu)(/) = v(u(/))-u(V £ /). 

This can be used to extend a connection V on a vector bundle E to its dual bundle E* 
and more generally to the tensor bundle T^E. 

If if : M —> N is a diffeomorphism between two manifolds and E —* N a vector 
bundle with a connection V £ , then we automatically find a unique connection xp*~V 
on the pullback bundle \p*E. The pullback connection is given by 

W*v) v (V7 ) = V>*(v5^ v /) 


for all / e r 1 ^) and v e T(TM). 
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1.2.1 Levi-Civita connection 


The Levi-Civita connection is the unique metric connection V on TM with C 00 metric g 
that is torsion free. Therefore it satisfies the Koszul formula 

2g(V u v, w) = ug(v, w) + vg(u, w) - wg(u, v) g 

- g(u 3 [v, w]) - g(v, [u, w]) - g(w, [u, v]). 

Usually we will call the covariant derivative associated to the Levi-Civita connection 
of (M, g) simply the covariant derivative. 

In a chart (U,<p) of M, we have 




r p d 

pv U P 


and we call r p JV the Christojfel symbols 5 of the Levi-Civita connection V for the 
chart ( U,ip ). The Christoffel symbols are symmetric in their lower indices r p flv = 
r p V|Lt because [d v , d v ] = 0. Given two vector fields v, w e r 1 (TM), written in the 
coordinate basis as v = v^d^ and w = w has covariant derivative 


V v w = v p 


dw p 

-1- r p w 1 

8x p pv 


along v. The Kozul formula (1.3) yields a formula for the Christoffel symbols 

p _ 1 pX f dgnX dg vX 
pv 2 dx p dx x ) 


1.2.2 Curvature of a connection 

Different from ordinary second derivatives, covariant second derivatives 

(V 2 /)(v,w) = V v V w /-V VvW /, 

where v, w e r 1 (TM) and / e T 2 (£), do not commute in general. The curvature 
F e r{T*M ® T*M ® E <g> £*) of a vector bundle E with connection V quantifies this 
failure of the covariant second derivative to commute and we define it as 

f(v,w) = V v V w - V w V v - V [VjW] . 


We say that the connection is flat its curvature F vanishes. If E = TM and V is 
the Levi-Civita connection of a pseudo-Riemannian manifold (M, g) with smooth 
metric g, we denote the curvature by R(v, w) instead and call it Riemann curvature 
(tensor) of (M, g). 

Let u,v,w e r 2 (TM) be vector fields and / e r 2 (E) a section of E. By defi¬ 
nition the curvature is skew-hermitian: F(v,w) = —F(w,v). If the connection V 
is compatible with a metric (•, ■) on E, then we also have that F is skew-adjoint, 
(F(v, w)f,f) = — (/,F(v, w)f), and that it satisfies the first Bianchi identity 

(V u F)(v, w) + (V v F)(w, u) + (V w F)(u, v) = 0. 


5 Note that V is not a tensor and thus the Christoffel symbols do not transform as tensors. 
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Figure 1.6. The parallel transport of a vector along a closed path. 

Furthermore, if E = TM and V the Levi-Civita connection, also the second Bianchi 
identity 

R(u, v)w + R(v, w)u + R(w, iz)v = 0 


holds. 

Several other curvature tensors can be derived from the Riemann curvature. 
The Ricci curvature (tensor) is defined as the symmetric (0,2) tensor Ric(v, w) = 
tr(iz —* R(v, u)(w)), where u,v,w e r 2 (TM); if it vanishes, we say that (M, g) is 
Ricci-flat. Furthermore, we obtain the Ricci scalar as the contraction of the Ricci 
tensor Seal = tr g Ric = tr Ric^. Using the coordinate (co)vectors, we can express the 
Riemann tensor in component form as: 

R a .= 2 ^.- 3 /" +r a , r p -r a r p ,. 

pvA v pX X pv p X pv pv pX 

The Ricci tensor and Ricci scalar are then written as R flv = — 8^R a ^ and R = 
g^'Rpv 


1.2.3 Geodesics 


Let 7 : / -> M be a smooth curve parametrized by an interval /CM and / e r 1 (y*£) 
a section in the pullback of the vector bundle E —* M. The covariant derivative of f 
along j is given by 


yf . 

dt 


(r*v K0 )/, 


i.e., it is a special case of a pullback connection. If the covariant derivative of / 
vanishes along y, we say that / is parallel to y. Therefore connections facilitate the 
notion of parallel transport along curves (Fig. 1.6, see also Sect. 1.4.2). Namely, given 
a s 0 e £ r (t 0 ) at the point y(t 0 ), the parallel transport of f 0 along y is the unique 
solution / of the ordinary differential equation V(y*/)/dt = 0 with initial condition 
(/°rXt 0 ) = /o- 

Auto-parallel curves, i.e., curves that satisfy 

^r*f(t) = o, 


are called geodesics of the connection V. These are the usual geodesics (local min- 
imizers of arc length if the metric is Riemannian) with respect to a metric g of a 
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pseudo-Riemannian manifold (M, g) if V is the Levi-Civita connection with respect 
to g. 

It follows from the theory of ordinary differential equations that, given a point x e 
M and any vector v e T X M, there exists a unique geodesic y v such that y v (0) = x and 
y v (0) = v. Let T x be the set of vectors v at x that give an inextendible geodesic y v 
defined (at least) on the interval [0,1]. We say that M is geodesically complete if 
T x = T X M at every x e M. Nevertheless, if there is a vector v ^ T x , there exists eel 
such that ev e T r 

The exponential map at x is defined as the map (Fig. 1.7) 

ex Px -r x ^M, v -» y v ( 1). 

That is, remembering that geodesics can be linearly reparametrized, the exponential 
map exp x maps vectors at x to geodesics through x. For each x e M there exists 
an open neighbourhood U' c T X M of the origin on which exp x is a diffeomorphism 
into an open neighbourhood U c M of x. If U' is starshaped 6 , then we say that U is 
geodesically starshaped (U is a normal neighbourhood ) with respect to x. Moreover, 
if U c M is geodesically starshaped with respect to all of its points, it is called 
geodesically convex. 


1.2.4 Killing vector fields 

Given an n-dimensional pseudo-Riemannian manifold (M, g), a Killing vector field is 
a vector field k such that 


s e K g = o. 

In terms of the Levi-Civita connection on (M, g), this equation may also be written as 
V a K b — V b K a = 0 in the abstract index notation. More generally, a conformal Killing 
vector field is a vector field k such that 


Sf K .g = tog with to = - tr(V/c). 

n 


6 A starshaped neighbourhood S of a vector space V is an open neighbourhood S c V of the origin 
such that tv e S for all t e [0,1] and v e S. 
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An equivalent definition of the Lie derivative of a covariant tensor S e r 1 (T (? 0 M) 
along a differentiable vector field k is given by 

^K-S = lim^(t/)*S-S) 

where ip t is the (local) flow of k. Therefore, a vector field k is (conformal) Killing 
vector field if and only if the flow that it generates is a family of local (conformal) 
isometries. In other words, k encodes a (conformal) symmetry of (M, g). 

Now, if y is a geodesic and k a Killing vector field, then it holds that g(/c, y) is 
constant along y. That is, the geodesics of (M, g) correspond to conserved quantities 
under the symmetry given by k. If k is a conformal Killing vector field, then g(fc, y) is 
constant only if g(f, f) = 0. 


1.3 Differential forms and integration 

As already noted above, sections of the cotangent bundle T*M are called (differential) 
1-forms. More generally, sections of the p-th antisymmetric tensor bundle f\ p (T*M ) 
are called (differential) p-forms. The set of all smooth p-forms on M is usually denoted 
Q p (M) = r°°(/\ p T*M ). 


1.3.1 Exterior derivative 

The exterior derivative d : fi p (M) —» 17 P+1 (M) is the unique generalization of the 
differential of functions such that: 

(a) df for 0-forms (i.e., functions) / e 17°(M) is the usual differential, 

(b) d is a A-antiderivation, i.e., it satisfies the product rule 

d(o) A p) = do> A rj + (— l) p to A dp, 


where to e f2 p (M ) and p e 

(c) d 2 = d o d = 0, 

(d) d commutes with pullbacks. 


We say that a form co is closed if doj = 0 and exact if to = dp for some form p. While 
a closed form is in general not exact, the opposite is obviously always true. The 
extend to which closed forms fail to be exact is measured by the de Rham cohomology 
groups H p r (M ) of the smooth manifold M 




{to e X7 P (M) | to closed} 
e fl p (M) | to exact} 


Replacing p-forms with compactly supported forms in the definition above, we obtain 
the related notion of the de Rham cohomology group with compact support H P R 0 (M). 

It is not difficult to show that the de Rham cohomology is a homotopy invariant 
and thus a topological invariant. This is quite astonishing considering that its defini¬ 
tion relies on the smooth structure of the manifold. Note that, if M is contractible, i.e., 
homotopy equivalent to a point, then all its de Rham cohomology groups vanish. 
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1.3.2 Integration 

A smooth n-form p, e J7 n (M) on a smooth n-dimensional manifold M is called a 
volume form if /i(x) 7 ^ 0 for all x € M. If such a volume form p exists, we say that 
M is orientable because p assigns a consistent orientation to all of M. Here, we 
say that a basis v 1 ,..., v n G T X M is positively (negatively) oriented at x with respect 
to to e fl n (M) if tu(x)(v 1 0 ... 0 v„R 0. If M is an orientable smooth manifold 
equipped with a metric tensor g, there exists a unique volume form p g , the g-volume, 
which satisfies 


MgOOOl 0 • • • 0 v n ) = yf\g\ = v / |det[g x (v i ,v j )]| 

for a positively oriented basis v 1 ,..., v n G T X M. 

Once integration over R n is defined, i. e ., given a domain U cR n 


r 

p p 

V = 

/ dx 1 A • • • A dx n = 

JU J 

u Jl 


f dx 1 • • • dx n 


for some / e C 00 (M' 1 ) such that v = f dx 1 A • • • A dx n , we can use the a partition of 
unity, local charts and linearity of the integral to extend the notion to general smooth 
manifold. That is, if to e 17 n (M) is a form of maximal degree on an orientable smooth 
manifold M which is compactly supported in the chart (JJ ,p), then 


- r 

CO = ± p*to, 

JM J 

where the sign depends on the orientation of the chart ([/, p) with respect to to. Thus 
we obtained a method of integration on manifolds that is invariant under orientation¬ 
preserving diffeomorphism invariant. 

Integrating a metric (•, •) on a vector bundle E over an orientable pseudo-Rieman- 
nian manifold (M, g), yields a natural inner product (•, •) on the sections of E\ Given 
/,fte -T(E), we define 


(f,h) 


(f,h)p g , 

Jm 


whenever the integral exists. 

Arguably the most important result on integration on manifolds is Stokes’ theorem 
- a generalization of the fundamental theorem of calculus. It states that the integral 
of an exact form do> e l7 n (M) over a relatively compact open subset U c M of a n- 
dimensional oriented manifold M is given by the integral of to over the C 1 -boundary 
8U: 

- r 

dd) = L*tO, 

JU J dU 

where t : d U < —* M denotes the inclusion map. Note that the classical theorems of 
Gauss (also called the divergence theorem) and Green are special cases of Stokes’ 
theorem. 
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1.3.3 Hodge star and codifferential 

Let (M, g) be an oriented pseudo-Riemannian n-dimensional manifold for the remain¬ 
der of this section. 

We can now introduce a smooth bundle homomorphism * : /\ P (T*M) —» 
/\" the Hodge star (operator); it is the unique bijection such that 

A*q = g(co,p)p, g 

for all to, p e 17 P (M). This implies the properties 

*l=H g , *p g = (-l) lnd(s \ **co = (-l) lnd ^ +p ^~ p) co. 

It also follows that we can rewrite the inner product (•, •) between differential forms 
induced by the metric as 

(to, rf) = to A * 17 . 

Jm 

for all to, 7) e 17 P (M) for which the integral is defined. 

The formal adjoint of the exterior derivative with respect to this pairing is the 
codifferential 5 : 17 P+1 (M) —■* 17 P (M): 

(to, 817 ) = (dm, 17 ) 


or, equivalently, 


5m = (-l) np+1+IndW *d*m. 


The codifferential is not a derivation and thus it does not satisfy the Leibniz rule. 
Note that for one-forms 17 the codifferential satisfies 5rj = — divp', i.e., it is equal to 
minus the divergence of the related vector field. 

Analogously to the case of the exterior derivative, a form co is called coclosed if 
5 co = 0 and coexact if co = 5 17 for some form 7). As a consequence of the bijectivity of 
the Hodge star one finds 


H 


n-p 

dR 


(M) = 


{me 17 P (M) | m coclosed} 
{me 17 P (M) | m coexact} ’ 


and, in particular, 


H P dR (M) = H 


n-p 

dR,0 


(M) 


if H p r (M) is finite-dimensional; the latter relation is a consequence of the Poincare 
duality theorem. 

A normally hyperbolic differential operator generalizing the usual d’Alembert 
operator, the Laplace-de Rham operator, can be obtained by composition of the 
codifferential and exterior derivative; we define it as 


□ = doS + Sod. 
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In abstract tensor notation the Laplace-de Rham operator acting on a (smooth) 
p-form co is given by the Weitzenbock type formula [145, Eq. (10.2)] 

(□ co )a 1 -a J) = ^b^ay-ap +^]^a m b CL) a 1 . a p + '^ i ^-a m ba n c co a l . a p - 

m m^n 

Consequently the Laplace-de Rham operator on scalar functions (0-forms), which we 
will also call the d’Alembert operator, is defined as 

□ = Sd = —V a V a , 

which differs from the definition in some publications of the author [3, 4] by a sign. 
1.3.4 Integral submanifolds 

While a vector field can always be locally integrated to give an integral curve, it is 
not true that to every rank k > 1 subbundle of the tangent bundle TM there exists a 
m-dimensional submanifold of a smooth manifold M. 

Let D <ZTM be a smooth rank-k subbundle, a plane field, and NcMan immersed 
submanifold. N is an integral manifold of D if 

T X N=D X 

for every x e N and we say that D is integrable. 

The plane field D is locally spanned by smooth vector fields v 1 ,...,v k , i.e., 
each local frame around a point x e M is given by the vector fields v ; such that 
v 1 1*,..., v k \ x is a basis of D x . Frobenius’ theorem states that D is integrable if and 
only if [v ; , Vj] x e D x for all i,j at every point x. 

A plane field D is equivalently locally specified in a neighbourhood U c M by a 
collection of covector fields ..., co n _ k such that 

D x = ker co 1 \ x n • • • n ker co n _ k \ x 

for every x e U. The dual to Frobenius theorem is then: D is integrable if and only if 
there exist smooth covector fields [rq j \ i, j = 1 ,..., n — k} such that 

n-k 

dco t =X>;A Vi,j, 

;'=i 

in other words, D is involutive. For example, a smooth covector field co specifies a 
codimension 1 integral manifold of M if and only if 

dco A co = 0. 

A foliation of dimension k of a smooth manifold M is a collection SF = (iV ; ) of 
disjoint, connected, (non-empty) immersed /c-dimcnsional submanifolds of M, the 
leaves of the foliation, such that their union is the entire manifold M and each point 
x e M has a chart (L7, tp) with coordinates (x 1; ..., x n ) such that for every leaf N t 
that intersects U the connected components of the image p{U fl Nf) are given by the 
equations x k+1 = const,..., x n = const. 
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If SF is a foliation of M, then the tangent spaces of the leaves form a plane field 
of M which is involutive. Conversely, the global Frobenius theorem states that the 
maximal integral manifolds of an involutive plane field D on M form a foliation of M. 


1.4 Bitensors 

Let (M, g) be a pseudo-Riemannian n-dimensional manifold for the remainder of 
this section. Henceforth we will use unprimed a,b,... and primed a', b',... abstract 
indices to distinguish between objects that transform like tensors at x and x' respec¬ 
tively. For example, a bitensor T e r{T^M [3 T®M) can be denoted by T ab c , ( x,x 
where relative position of primed and unprimed indices is arbitrary. Note that T ah c , 
transforms like a contravariant 2-tensor at x and as a covector at x'. While many of 
the operations presented in this section generalize to sections of arbitrary exterior 
tensor bundles, we limit the discussion to bitensors, i.e., sections of T^M Kl TfM, in 
favour of concreteness rather than generality. 

Taking covariant derivatives of bitensors, we further notice that derivatives with 
respect to x and x' commute with each other. That is, every (sufficiently regular) 
bitensor T(x,x') satisfies the identity 

T ;ab' = T ;b'a> 

where we have suppressed any other indices (we will do the same in the next two 
paragraphs). Partial derivatives commute as always. 

Often one is interested in the limiting behaviour x' —» x . This limit is called the 
coincidence or coinciding point limit and can be understood as a section of a tensor 
bundle of M whenever the limit exists and is independent of the path x' —» x. If a 
unique limit exists, we adopt Synge’s bracket notation [209, Chap. II.2] 

[T](x)= lim T(x,x'). 

x'—>x 

An important result on coincidence limits of bitensors is Synge’s rule: Let T(x,x') 
be a bitensor as above, then [179, Chap. 1.4.2] 

[T] ; a = [T ;a ] + [T-a'] or [T. a ,] = [T]. a -[T ;a ] (1.4) 

whenever the limits exist and are unique. The second equality is a useful tool to turn 
unprimed derivatives into primed ones and vice-versa. 


1.4.1 Synge’s world function 


In a geodesically convex neighbourhood U we can define the geodesic distance between 
two points x,x' e U as the arc length of the unique geodesic / joining x = y(t 0 ) and 
x' = /(ti). It is given by 


d(x,x0 


"h 

g r {t)(r,r) 1/2 dt = (d - tohxCi, f ) 1/2 


J c 0 

because the integrand is constant along the geodesic as a consequence of the geodesic 
equation. A slightly more useful function is Synge’s world function, introduced in 
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[186, 187, 208], which gives half the squared geodesic distance between two points 
(thus it is sometimes also called the half, squared geodesic distance ). Namely, 

cr(x, x') = |d(x, x'f = |(ti — t 0 ) 2 g x (f, f). (1.5) 

In terms of the exponential map the geodesic distance and the world function can be 
expressed as 

d(x, x) = g x ( exp“ 1 (x / ), exp^CxO) 1/2 
cr(x, x) = \g x ( exp^CxO, exp^C*')) • 

Note that geodesic distance d and Synge’s world function a are invariant under 
linear reparametrizations of the geodesic y. Furthermore, they are both examples of 
bitensors - in fact, biscalars. 

For the covariant derivatives of the world function it is common to write 


<J a 1 -a p b' 1 -b' q ^ ai 


V 


K 




i.e., we always omit the semicolon on the left-hand side. From (1.5) one can compute 
cr a (x,x') = (t 1 - t 0 )g ab r h and cr^^x^') = (t 0 - t^g^^y*, 


where the metric and the tangent vector are evaluated at x and x' respectively. 
Consequently we have [cr a ] = [cr a /] = 0. 

According to (1.5), the norm of these covectors is given by the fundamental 
relation 

<J a cr a = 2 <r = cr a /cr a . ( 1 - 6 ) 

Therefore, cr a and cr a / are nothing but tangent vectors at x and x' to the geodesic 
y with length equal to the geodesic distance between these points. Actually, (1.6) 
together with the initial conditions 

[cr] = 0 and [cr ab ] = [cr Q , b ,] = g ab (1.7) 

can be taken as the definition of the world function. Coincidence limits for higher 
derivatives of cr can be obtained by repeated differentiation of ( 1 . 6 ) in combination 
with Synge’s rule (1.4). In particular one finds [179] 

l>abcl = [cr ab y] = [cr abV ] = [<Vb'c'] = 0- (1-8) 

An important biscalar that can be constructed from the world function is the van 
Vleck-Morette determinant defined as [156] 

A(x,x') = sgn(detg x )(detg x detg x /) -1/2 det ( — a ab /(x,x / )) >0. (1.9) 

The van Vleck-Morette determinant expresses geodesic (de)focusing: A > 1 implies 
that geodesics near x and x' undergo focusing whereas A < 1 implies that these 
geodesics undergo defocusing [179]. 

From (1.9) one can derive the transport equation 

n = cr a ( ln/\) a + cr a a = cr a '(ln A) a / + cr a ' a ,. 


This, together with the initial condition [A] = 1, can also be used as an alternative 
definition of the van Vleck-Morette determinant. 
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1.4.2 Parallel propagator 

Another important biscalar is the parallel propagator. It does exactly what its name 
suggests: in a geodesically convex neighbourhood U it transports a vector of a fibre 


at a point x' e U to a vector at x e U along the unique geodesic y joining the two 


points. We will denote it by the same symbol as the metric. Here we define it only for 
the tangent bundle, where we write 



with v a e T X M and v a e T X >M. 


The parallel propagator satisfies the transport equation 



because cr a and cr a are tangent to the geodesic y at x and x' respectively. Together 
with the initial condition 



( 1 . 10 ) 


this equation may be taken as the definition of the parallel propagator. With the 
parallel propagator we can raise and lower the (un)primed index in g a b , with the 
usual metric and use it to transport (un) primed indices of tensors to their opposite. 

1.4.3 Covariant expansion 

It is possible to generalize Taylor’s series expansion method to bitensors (and thereby 
also to tensors). The covariant expansion was originally developed for ordinary 
tensors [186, 187] but can be easily extended to bitensors, see for example [179, 
Chap. 1.6]. 

While Taylor’s method is used to expand a function fix') around a point x in 
terms of powers of the distance x' — x with series coefficients given by the derivatives 
of / at x, the covariant expansion method replaces functions with bitensors, distance 
with geodesic distance as given by cr Q (x, x') and ordinary differentiation with the 
covariant derivative. That is, given a bitensor T a (x, x 7 ), where a = a 1 ■ ■ ■ a m is a list 
of unprimed indices, we perform the expansion 



( 1 . 11 ) 


k =0 


We can then solve for the expansion coefficients by repeated covariant differentiation 
and taking the coinciding point limit. Namely, it follows from (1.11) that 



etc. If T is a bitensor that also has primed indices, one first has to transport the primed 
indices to unprimed ones using the parallel propagator g a b ,- 

Of course, just like any other Taylor expansion, the covariant expansion of a 
smooth bitensor does in general not converge and if it converges it is not guaranteed 
to converge to the bitensor that is being expanded. Only if the bitensor and the 
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metric are analytic, we will not encounter these difficulties. However, even then 
the covariant expansion can be a very useful asymptotic expansion and we usually 
truncate it after a finite number of terms. Therefore, in following we will work with 
the covariant series in a formal way and whenever we write an infinite sum we 
implicitly mean that the series is to be truncated and a finite remainder term is to be 
added. 


1.4.4 Semi-recursive Avramidi method 


As described, e.g., in [69, 169], the ‘naive’ recursive approach to calculate the 
expansion coefficients of a covariant expansion, as briefly sketched above, is inefficient 
and does not scale well to higher orders in the expansion because the calculation 
of coincidence limits becomes computationally prohibitive. An alternative, non¬ 
recursive and elegant method for the calculation of these coefficient was proposed by 
Avramidi [21, 22], From the computational perspective, however, also this approach 
is not necessarily optimal as it does not always make good use of intermediate result 
leading to an algorithm which is space but not time efficient. A middle way, that we 
will present here, was implemented in [169] using a ‘semi-recursive’ method. 

Avramidi’s approach rests on the power series solution approach to solving dif¬ 
ferential equations. Therefore it can only be applied where the bitensor solves a 
differential equation, the transport equations. 

With the transport operators 

V CT = cr a V a and V; = (7 a 'Vy. (1.12) 


the world function (1.6) can be expressed as (V CT — 2)cr = 0 or (V^. — 2)cr = 0. One 
can construct additional transport equations by differentiating these equations and 
commuting covariant derivatives at the cost of introducing curvature tensors. In 
particular we find [169] 


\7> ra' _ ra' _ ra' re' _ nfl' 

v (jS y — S y S y S y - n - yyy 

V / CT A 1 / 2 = |A 1 / 2 (n-rlO, 


C7 c V', 


(1.13a) 

(1.13b) 


where we have defined E, a h , = cr a b , to avoid confusion later on. 

In the next step we take the formal covariant expansion of the bitensor and use 
the transport equation to find relations between the expansion coefficients. Let us 
again take a bitensor T a (x, x r ) with the expansion (1.11). Applying to it the transport 
operator V' , we obtain formally 


v a ux,x , )=Y J 

m=0 


00 


£ 

m =1 


(~D m 

m! 

(~D m 

m! 


mt ab 1 -b m (^)o' bl ---cr bm 3 


i.e., applying V' T to the m-th term is equivalent to multiplying it by m: 


(V / a T) (m) = kT (m) . 
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Contracting two bitensors Sq^x') and T b (x,x we further find 

00 f—i') m k fm\ 

Uqb — SacT b = ^ | ( h. ) s acd 1 -d k t bd k+1 -d m a 1 '' ' a m > 

m =0 ' k =0 k ' 

i.e., the m-th expansion coefficient of U is obtained from the lower order coefficients 
of S and T: 


U ab{m) 



Sac (fc) 


T 


b(m—k)‘ 


Moreover, we follow Avramidi and introduce 


3 <C a h( ,=R a f ... ,cr Cl • • • cr Cm 

b(m) (ci|b|c 2 ;c 3 -c m ) 


so that we can write the covariant expansion of R a ,,,,,cr c (j d as 


oo 

a a a b' T> a ’ rr c 'rr d ' — \ ' ^ °ir a 

& a'8b R c'b'd ,<J 2u( m — 2)< b(m)' 

m=2 v J ’ 

These formal manipulations and definitions can now be applied to find the 
covariant expansion coefficients of a bitensor. Here we will apply the method to E, a b , 
and A 1 ! 2 with their transport equations (1.13). It follows from (1.7) and (1.8) that 
SVo) = ^b an< ^ b(i) = Then, for m> 2, we can easily find the relation 


m—2 s x 

-(m + l)C bm = J] ? a c(fc) ? c b(m _ fc) + m{m - l)3C (m) . 

For the coefficients of the square-root of the van Vleck-Morette determinant one gets 
A j^ 2 = 1 from (1.7) and (1.10) and 





E a 

^ a (m—k) 


for m > 0. Equivalent relations were found in [169] and implemented in the 
Mathematica™ package CovariantSeries. 


1.4.5 Coordinate expansion of the world function 

Our aim in this section is to find an expansion of Synge’s world function cr in a 
coordinate neighbourhood. There are different ways to achieve this (see for example 
[83] for an approach using Riemannian normal coordinates). Here we use a (formal) 
power series Ansatz and write 


00 ^ 

a(x, xO = J] — ?Ml ... Mm (x)Sx^ • ■ • 5 x Mm , 

m =0 


where we denote by 5x M = (x 7 — x) M the coordinate separation of the points x,x' in 
a chart. 

The transport equation (1.6) can then be applied to obtain relations between the 
coefficients As consequence of (1.7) and [cr^] = 0, the first three 
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coefficients are 


? = °> = 0 and 


This can be used to derive the relation 


r"—° 



2(1 - 


\,V 



*^m—l»^m) 


(1.14) 


for m > 2 after a cumbersome but straightforward calculation. 

It is not difficult to implement (1.14) efficiently in a modern computer algebra 
system; it involves multiplication, transposition and symmetrization of multidimen¬ 
sional arrays and partial derivatives on the components of these arrays. Making use 
of the symmetry of the coefficients, can reduce computation time and memory usage 
for higher order coefficients, especially for ‘complicated’ metrics, significantly. Even 
if these symmetries are not used, this method appears to be more efficient than the 
method of [83], as it does not require to compute Riemann normal coordinates first. 
In fact, because the Riemann normal coordinates are given by the derivative of the 
world function, the coefficients can also be used to compute the expansion of 

Riemann normal coordinates: 

cr M (x, x) = g pv 5x v + d (? VPjM + ?pv P )5x v 5x p + ^(g vp x,ii + q llvp x)5x v 5x p 5x x + ■■■ 

Again, this method appears more direct and faster than the one described in [40] . 

For example, for the Friedmann-Lemaitre-Robertson-Walker metric in cosmologi¬ 
cal time (see Sect. 2.3.1 for more details), 


g = —dt ® dt + a(t) 2 5 i j dx ! <g> dx J , 


one finds with H = a/a 


2cr(x,x / ) = -5t 2 + a 2 5x 2 (l + H5t + |(H 2 + H)5t 2 + ±cl 2 H 2 5x 2 
+ +H)5t 3 + j^a 2 H(2H 2 +H)5t 5x 2 

+ T5x( — 4H 4 - 8 H 2 H + 2H 2 + 6 HH + 3 H)5t 4 

loU 

+ ^a 2 (48H 4 + 74H 2 H + 8 H 2 + 9 HH)5t 2 5x 2 
+ -!-a 4 H 2 (4H 2 + 3 H)) 8x 4 + 0(5x 7 ) 


in agreement with [83]. Above we denoted by a dot derivatives with respect to 
cosmological time t and defined the coordinate separation 5t = (x 7 — x)° and 
5x 2 = j(x' — x)\x' — x) } . Transforming to conformal time, so that 
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one finds with = a'/a 

2 a“ 2 cr(x, x 7 ) = —5t 2 + 5x 2 — W <5t 3 + W5t5x 2 — j^(776 2 + 4^05t 4 

+ |(3^f 2 + 2W')5t 2 5x 2 + ±p€ 2 5x 4 - j^{3W 3 + 5 WW' + W")5x 5 
+ ^(2^ 3 + 4 7€7€' + W")5x 3 5x 2 + ^7€{7f€ 2 + W')5t5x 4 
- ^(31^ 4 + lOlST 2 ^ + 28W' 2 + 39WW" + 6W"')5t 6 
+ ^(15^ 4 + 61W 2 W' + 20W' 2 + 303^" + 6W" / )5t 4 5x 2 + 

+ ^(15^ 4 + 3 77t 2 7t' + 87€ 12 + 9W7€")5t 2 5x 4 
+ ^(W 4 + 37€ 2 7fe')5x 6 + 0(5x 7 ), 

where we denoted by a prime derivatives with respect to conformal time t and 
defined 5t = (x' — x)° and <5x as above. 

The method described above is sufficiently fast to be applied to more complicated 
spacetimes such as Schwarzschild or Kerr spacetimes. Nevertheless, for such space- 
times the expansions become so long, that they easily fill more than a page if printed 
up to sixth order. Therefore we will omit them here and suggest that the interested 
reader implements (1.14) in a computer algebra system. 


Lorentzian geometry 



Summary 

Around the turn of the 20th century it became clear that several physical phenomena 
could not be satisfactorily described in the framework of Newtonian physics. After 
insights by Lorentz, Poincare and others, a solution to some of these problems was 
found and published by Einstein in 1905 and coined special relativity. In its modern 
form the changes introduced by Einstein can be seen to arise from to the union 
of space and time in a single spacetime. In the absence of gravity or in the weak 
gravitational limit our universe appears to be well described by Minkowski spacetime, 
its causal structure and its symmetries. 

The changes introduced by Minkwoski spacetime are generalized by the concept 
of Lorentzian spacetimes. This class of pseudo-Riemannian manifolds forms the basis 
of general relativity, introduced by Einstein in 1916, which takes the lessons learned 
from special relativity to formulate a geometric theory of gravitation. In Einstein’s 
theory of gravitation the geometrical background of the Universe is itself dynamical 
and accounts for gravity via Einstein’s equation. The total lack of static background 
with respect to which length and time measurements can be made, makes general 
relativity significantly more difficult than Newton’s classical theory of gravitation. 

Nevertheless, general relativity has made many predictions like gravitational 
redshift, gravitational lensing and, in a sense, the Big Bang, that have been confirmed 
with modern observations. It also forms the basis of modern cosmology which 
attempts to describe the evolution of the Universe on large scales from the time of 
the Big Bang until present times and predict its ‘fate’. 

This chapter will give a short introduction to the causal structure of Lorentzian 
spacetimes, to general relativity and to cosmology in three separate sections. The 
aim of the first section (Sect. 2.1) is the definition of globally hyperbolic spacetimes 
and Cauchy surfaces. Therefore it will define notions such as spacetimes, time 
functions, the causal past and future of a set, causality conditions, and the splitting of 
a spacetime into its temporal and spatial part. In the second section (Sect. 2.2) we 
give an overview of general relativity including the stress-energy tensor, the classical 
energy conditions, Einstein’s equation and its fluid formulation, and the special case 
of de Sitter spacetime. Topics related to theoretical cosmology will be addressed in the 
third section (Sect. 2.3), where we discuss Friedmann-Lemaitre-Robertson-Walker 
spacetimes and perturbations around it. 

As we are presenting standard material, we will again mostly refrain from giving 
proofs or even sketching them. Excellent references for this chapter are the books 
[27, 80, 167, 216]: While [167] and in particular [27] contain detailed discussions 
of causal structure of Lorentzian spacetimes, [216] gives an good overall account 
of general relativity. The recent book [80] is, as the title already suggests, mostly 
focused on cosmology, and may be taken as a reference for the last section. 
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2.1 Causality 

2.1.1 Causal structure 


The metric g of an arbitrary Lorentzian manifold (M, g) 

distinguishes three regions 

in the tangent space: a nonzero v e T X M 

is 



spacelike 

if 

g* 0 ,v) > 

0 , 

lightlike or null 

if 

SxO>v) = 

0 , 

timelike 

if 

g* 0 ,v) < 

0 , 

causal 

if 

g* 0 ,v) < 

0 . 


Furthermore, we define the zero vector v = 0 as spacelike, as e.g. in [167]. 

Given a time-orientation u on M, viz., a smooth unit timelike vector field such 
that g(u, u) = —1, a causal vector v e T X M is 

future-directed, if g x (u(x), v) < 0 , 
past-directed if g x (u(x), v) > 0 . 

One often chooses the time-orientation to be the velocity of a physical fluid. Not every 
Lorentzian manifold can be given a time-orientation and we say that a Lorentzian 
manifold is time-orientable is such a vector field exists. 

All these notions extend naturally to the cotangent bundle T*M, sections of both 
TM and T*M and curves. That is, as seen in Fig. 2.1, to each point x e M we can 
draw a double cone inside M whose surface is generated by the lightlike curves 
passing through x; points lying inside the double cone are lightlike to x, while points 
lying outside are spacelike to x. 

A continuous function t : M —» R that is strictly increasing along every future- 
directed causal curve is called a time function. Clearly, if t is a time function, then its 
gradient yields a time-orientation which is orthogonal to the level surfaces 

f 1 (c)={xeM|/(x) = c}, cel. 

These level surfaces give a foliation of the manifold such that each leaf is spacelike 
and intersected at most once by every causal curve. 

The chronological future I + (U) (chronological past /“([/)) of a subset U <z M is 
defined as the set of points which can be reached from U by future-directed (past- 
directed) timelike curves. Similarly, the causal future J + (U) (causal past J~ ([/)) of 
a subset U c M is defined as the set of points which can be reached from U by 
future-directed (past-directed) causal curves; their union J(U ) = J + (U ) U J“(L7) is 
the causal shadow of U. If the set U consists of only one point U = {x}, we write 
/ ± (x) and J ± (x). Note that / ± (x) is always open, whereas J ± (x) is not necessarily 
closed . 1 We say that two subsets U and V are causally separated (in symbols: i/XV) 
if U D J(V) = 0. These definitions are illustrated in Fig. 2.2. 

A world line or observer is a timelike future-directed curve t >-> y(r) such that 
g(t, y) = — 1 and the curve parameter t is called the proper time of y. The observer y 


1 In globally hyperbolic spacetimes (to be defined below) J ± (x) is always closed. 
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Figure 2.1. Lightcone of a point in a three-dimensional Minkowski spacetime. 

is said to be freely falling, i.e., moving only under the influence of gravity, if j is a 
geodesic. 

The causal structure of a Lorentzian manifold is physically significant because it 
is used to imply that two events, i.e., two points on the manifold, can only influence 
each other if one is in the lightcone of the other. In other words, the Lorentzian causal 
structure encodes the finiteness of signalling speeds. This is distinctively different 
from Newtonian physics, where one event at an absolute time t 0 influences all other 
events at a later time t > t 0 , no matter the spatial separation. 

2.1.2 Covariant splitting 

Let (M, g) be a (1 + n)-dimensional Lorentzian manifold with a time-orientation u. 
Observe that the integral curves of u can be understood to define a preferred direction 
of motion. It is clear that the integral curves of u can be parametrized such that they 
are world lines. A local frame such that the components of u are 

u^ = (l,0 ,...,0) 


is called a comoving frame. 

Orthogonal to u at each point are the rest spaces of the associated observer. An 
induced Riemannian metric tensor for these n-spaces is given by the projected tensor 

h = g + u b 0 u b , 

which has the following properties: h ab u b = 0, h a c h c b = h b , h a a = n. 

By means of the projection h and the time-orientation u, we can decompose any 
tensor into its temporal and spatial parts with respect to the observer. In particular, 
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Figure 2.2. Causal past and future of a set and a causally separated set. 


we can define the temporal and spatial covariant derivatives of any tensor S a "' b by 

S a - b ... = u c V c S a - b ..., 

v.sv= i y"‘i'"'W 

Moreover, define the projected symmetric trace-free (PSTF) parts of a vector field v a 
and a tensor S ab by 

v (a> = h\v b , S {ah) = ( \ a c h b) d - lh ab h cd )S cd . 

The kinematics of the world lines given by u is characterized by V b u a which can 
be decomposed as 

^ b^a ^b^a ^a^b ^ab d” ®ab ^a^b ^ab d“ ®ab d” ^®^ab ^a^b> (2.1) 

with 

^ab ^[b^a]> ^ ab ^(a^b)> ® ® a ^> ®ab ^(a^b)> 


where co ab denotes the vorticity (twist) tensor, O ab the expansion tensor, 0 the 
(volume) expansion scalar, cr ab the shear tensor, and ii a the acceleration. The expansion 
scalar measures the separation of neighbouring observers and can be used to introduce 
a length scale a via the definition 

d ^ 0 
a n 


Furthermore, we shall define the magnitudes co and a of the vorticity and the shear 
tensor by 


. ,2 -i. 1 , , .ab 

CO — -CO ab CO , 


a 


l°abV ab - 


Using the definition of the Ricci tensor, we find 2V[ a V b] u a iz b =R ab u a u b which, 
together with (2.1), yields an equation describing the dynamics of world line 

-R ab u a u b = 0 + b© 2 + 2(a 2 - co 2 ) - u a . a , (2.2) 

where R ab u a u b is sometimes called the Raychaudhuri scalar. 

If the vorticity tensor vanishes, then, by Frobenius’ theorem, u will be orthogonal 
to a foliation of the spacetime by n-dimensional Euclidean hypersurfaces T, whose 
metric is given by (the pullback of) h. If the manifold is also simply connected so 
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(a) 



(b) 


Figure 2.3. Sets on the two-dimensional Einstein cylinder: (a) is causally convex, 
while (b) is not, as illustrated by the same causal curve in both figures. 


that its first de Rham cohomology group vanishes, then there exists even a time 
function. J Furthermore, in the case of vanishing vorticity, V can be directly identified 
with the Levi-Civita connection on the spatial slices £ with metric tensor h. Therefore 
it can be used to derive relations between the Riemann curvature R abcd of (£,h) 
and the Riemann curvature R abct j of the whole manifold (M,g). The Gauss-Codacci 
equation [216, Chap. 10] is 

R abcd = h a p h b q h c r h d s R pqrs - 20 a [ c 0 d ] b , (2.3) 

where we have expressed the extrinsic curvature via by the expansion tensor. Upon 
contraction with the projection h the spatial Riemann tensor leads to the projected 
Ricci tensor and Ricci scalar. 


2.1.3 Cauchy surfaces 

Although many results hold in greater generality, for physical reasons we will from 
now restrict our attention to a subclass of four-dimensional Lorentzian manifolds: 

Definition 2.1. A spacetime is a connected, oriented (positively or negatively oriented 
with respect to the volume form induced by the metric) and time-oriented, four-dimen¬ 
sional smooth 2 3 Lorentzian manifold (M,g,±, u). Usually we omit orientation and 
time-orientation and identify a spacetime with the underlying Lorentzian manifold 
(M, g); sometimes, when no confusion can arise, we will even drop the metric and say 
that M is a spacetime. 

Not all spacetimes are of physical significance because they admit features like 
closed timelike curves (i.e., ‘time machines’) that are usually considered unphysical. 
A spacetime that does not have any closed timelike curves (i.e., x ^ / + (x) for all 
x e M) is said to satisfy the chronological condition; no compact spacetime satisfies 
the chronological condition. A slightly stronger notion is the causality condition, 
which forbids the existence of closed causal curves. 

Given an open set U c M, it is called causally convex if the intersection of any 
causal curve with U is a connected set (possibly the empty set). That is, a causal 

2 A local time function will always exist if the vorticity is vanishing because every manifold is locally 
contractible. 

3 On some occasions we will work with spacetimes whose metric is not smooth. In all these cases 
we will mention the regularity of the metric explicitly. 
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curve intersecting U cannot leave the set and enter it again (Fig. 2.3). If for every 
neighbourhood V of x e M there is a causally convex neighbourhood U c V of x, 
then the strong causality condition holds at x. A spacetime (M, g) is called strongly 
causal if it is strongly causal at every point. Finally, if (M, g) is strongly causal and 
J + (x) n J~(y) is compact for every pair tjeill, it is called globally hyperbolic. 

A Cauchy surface is a subset £ c M which is intersected exactly once by every 
inextendible timelike curve (and at least once by every inextendible causal curve). 
Therefore, the causal shadow of a Cauchy surface is the entire spacetime. Note that a 
Cauchy surface may be non-spacelike and non-smooth. Our ability to pose a Cauchy 
problem on a spacetime requires the existence of Cauchy surfaces. 

The existence of a Cauchy surface imposes strong conditions on the causality of 
the spacetime. Given a spacetime, the following statements are equivalent [35, 167]: 
the spacetime 

(a) is globally hyperbolic, 

(b) admits a Cauchy surface, 

(c) admits a smooth time function compatible with the time-orientation. 

Therefore we will be mostly interested in globally hyperbolic spacetimes in the 
following. Every globally hyperbolic spacetime (M, g) is diffeomorphic to I x g, 
where £ is diffeomorphic to a smooth spacelike Cauchy surface of M [34]. If we 
denote by t the time function on M, then the level sets of t are isometric to (27, g t ), 
where g t is a Riemannian metric on £ depending smoothly on t. In fact, (M, g) is 
isometric to R x 27 with the metric 

—fi dt ® dt + g t , 

where /3 is a smooth function on M. 

2.2 General relativity 

Throughout this section let (M, g, ±, iz) be an arbitrary spacetime, unless otherwise 
specified. 

2.2.1 The stress-energy tensor 

General relativity describes the interaction of classical matter with the geometrical 
structure of the Universe. Giving a precise definition of matter is difficult if not 
impossible. Mathematically matter is described in general relativity as a covariant or 
contravariant symmetric 2-tensor field T ab or T ab , the energy-momentum or stress- 
energy tensor which is covariantly conserved 

V b T ab = 0. 

The physical content of the stress-energy tensor T ab becomes clearer once we 
perform a covariant splitting relative to u and decompose T ab as 

Tab = pu a u b +PKb + 2q (a u b ) + n ab , 


(2.4) 
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with 

p = T ab u a u b , p = lT ab h ab , q a = ~T {a)b u b , n ab = T {ab) , (2.5) 

where p denotes the energy density, p the pressure, q c the momentum density (viz., 
the dissipation relative to u a ) and n ab the anisotropic stress. By definition of these 
quantities, the trace of the stress-energy tensor is given by 

T = T a a = -p + 3p. 

Instead of working with the stress-energy tensor it is thus possible to work with 
these four quantities and the equations of state relating them. For example, in this 
decomposition the conservation equation T ab . b = 0 splits into the energy conservation 
and the momentum conservation equation 

p+(p+ p)0 + n ab cr ab + V a q a + 2 u a q a = 0, (2.6a) 

V a P + (P + P)u a + n ab u b +V n ab + q a + ^0q a +(cr ab +co ab )q b = 0, (2.6b) 

which are the familiar equations for an inertial observer on Minkowski spacetime. 

If the anisotropic terms in (2.4) vanish (q a = 0 = rr ab ), the stress-energy takes 
the perfect fluid form 

Tab = pu a Ub+ph ab = (p +p)u a u b +pg ab (2.7) 

and the conservation equations (2.6) reduce to 

p + (p + p)0 = 0, (2.8a) 

V a P + (p+p)u a = 0. (2.8b) 

One further distinguishes between different forms of the general equation of state 
p = p(p,s ), where s is the medium’s specific entropy. Namely, if p = p(p), one speaks 
of a barotropic fluid, and if p = 0, we have pressure-free matter, also called ‘dust’. 

While general relativity imposes no a priori constraints on the form of the matter 
content, many possibilities can be considered unphysical in classical physics. The 
most common energy conditions are: 

(NEC) null energy condition 

T ab v a v b > 0 for all lightlike v a , 

i.e., no negative energy densities along any lightray; 

(WEC) weak energy condition 

T ab v a v b > 0 for all timelike v a , 

i.e., no observer detects negative energy densities; 

(DEC) dominant energy condition 

T ab v a w b > 0 for all future-pointing timelike v a , w b , 


i.e., the stress-energy flux is causal; 
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(SEC) strong energy condition 

T ab v a v b - \T > 0 for all unit timelike v a , 

i.e., gravity is attractive if A = 0, see (2.13). 

By continuity, WEC implies NEC and it is also not difficult to see that DEC implies 
WEC. Moreover, the SEC does not imply the WEC but only the NEC. The reverse 
implications are generally not true. Note that the strong energy condition is too 
strong for many physically relevant scenarios. 

A generic feature of quantum field theory is that none of the energy conditions 
above will hold, even in an averaged sense, because of the Reeh-Schlieder theorem 
[110, 192]. Instead one finds lower bounds on the averaged energy density, called 
quantum energy inequalities, see [87, 88] for a review of the subject. 

2.2.2 Einstein’s equation 

Einstein’s equation with a cosmological constant are 

1 . 8 tiG 

R ab - 7t R gdb + Agab = G ab + Ag ah = — ^~T ab , (2.9) 

2 c 4 

where G ab is called the Einstein (curvature) tensor and A the cosmological constant. 
The constants on the right-hand side are Newton’s gravitational constant G and the 
speed of light c; we shall always choose units such that 8 ttG = c = 1. Often we will 
also work with the trace of (2.9): 

-R + 4A = T. (2.10) 

Sometimes one absorbs the cosmological constant into the stress-energy tensor to 
emphasize its non-geometric nature. 

Note that Einstein’s tensor is covariantly conserved and symmetric, i.e., 

V b G ab = 0 and G ab = G^ ab - >, (2.11) 

so that the left-hand side of (2.9) is consistent with the right-hand side and gives a 
second-order differential equation in the metric. In fact, we can derive (2.9) from 
the assumption that the stress-energy tensor of a matter field should be the source of 
a gravitational potential (the metric tensor) in a second-order differential equation. 
Since the stress-energy tensor is conserved and symmetric, (2.9) is the only possibility. 

Combining (2.9) with (2.10), Einstein’s equations can be recast into the equivalent 
form R ab — Ag ab = T ab — -/fg ab - Together with the imperfect fluid form (2.4) of the 
stress-energy tensor this equation yields 

R ab - Agab = |(P + 3 p)u a u b + \(p - p)h ab + 2q {a u b) + n ab . 

Contractions with the time-orientation u a and the associated projector h ab to the 
orthogonal surfaces then give the three equation 

Kb^^ = |(p +3p) - A, 

R bch b u c = —q a , 

R cdK C}l b d = \(.P - P^ h ab + Ah ab + n ab . 


( 2 . 12 a) 

( 2 . 12 b) 

( 2 . 12 c) 
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The Raychaudhuri scalar (2.2) attains the physical meaning of the active grav¬ 
itational energy by (2.12a) and thus we can state the Raychaudhuri equation as: 

0 + \Q 2 + 2(cr 2 - co 2 ) - V a ii a - u a ii a + |(p + 3p)-A = 0. (2.13) 

Hence we see that expansion, shear and matter (satisfying the strong energy con¬ 
dition) promote gravitational collapse, whereas a positive cosmological constant, 
vorticity and positive acceleration (due to non-gravitational forces inside the medium) 
oppose gravitational collapse. One can also derive differential equations for the shear 
and the vorticity [212], 

2.2.3 De Sitter spacetime 

A Lorentzian manifold (M, g) is called a vacuum solutions to Einstein’s equation (2.9) 
if both T ab and A vanish globally. That is, such a solution satisfies 

R ab = 0- 

The vacuum Einstein equation is the most studied special case of the Einstein equation 
and many important and instructive examples fall into this class. The most basic 
solution is Minkowski spacetime (M, rj), see (1.2), which describes a featureless 
empty universe. A rotation-symmetric vacuum solution is given by the famous 
Schwarzschild solution. 

If we also allow for a cosmological constant but keep T ab = 0, we need to solve 

R ab Agab■ 

The maximally symmetric solutions of this equation fall into three classes depending 
on the sign of A. We focus here on the case A > 0 called de Sitter spacetime; for A = 0 
one obtains Minkowski spacetime and for A < 0 the so-called anti-de Sitter spacetime. 

Four-dimensional de Sitter spacetime is the hyperboloidal submanifold of five¬ 
dimensional Minkowski spacetime with coordinates (y°, y 1 ,.y 4 ) that satisfies the 
equation 

-(J°) 2 + (J 1 ) 2 + - + (J 4 ) 2 =H- 2 , A = 3H 2 , (2.14) 

where H > 0 is called the Hubble constant; thus de Sitter space is topologically IxS 3 . 
The pullback of the Minkowski metric to this space yields a Lorentzian metric. 

A global coordinate chart with coordinates (t ,/, 6, ip) can be defined by 

y° = H _1 sinh(Ht), / = H _1 cosh(Ht)z', 

where z l = z l (%, 0, ip), i = 1,2,3,4, are the usual spherical coordinates on S 3 with 
unit radius. In these coordinates the induced metric on de Sitter spacetime reads 

g = —dt (g> dt + H~ 2 cosh(Ht)(d^ <g> dy + sin 2 / (d0 <g> d0 + sin 2 0 d ip <g> dip)) 

= —dt ®dt +H _2 cosh(Ht)g S 3, 

where g S 3 denotes the standard metric on the 3-sphere. It is clear, that de Sitter 
spacetime is globally hyperbolic and the constant time hypersurfaces are Cauchy 
surfaces. 
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Another coordinate chart with coordinates (t, x 1 , x 2 , x 3 ) is given by 

y° = H _1 sinh(Ht) + \He m r 2 , 
y l = e Ht x i , 

y 4 = H _1 cosh(Ht) - |He Ht r 2 , 

where i = 1,2,3 and r 2 = (x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 . It covers only the half of de 
Sitter spacetime which satisfies y 0 + y 4 > 0 and is called the cosmological patch or 
cosmological chart; it is diffeomorphic to four-dimensional Euclidean space. Within 
the cosmological chart, the metric reads 

g = —dt 8> dt + e 2Ht 5 i j dx l (8) dx ; = (Ht) - 2 (—dr 8> dT 4- 5 i; - dx ! 8) dx ; ), 


where we define the conformal time t e (0, oo] via 


t 


T(t) = - 


afV) -1 dt'. 


Note that this metric is a special case of a Friedmann-Lemaitre-Robertson-Walker 
metric, to be discussed in more generality in Sect. 2.3. 

On de Sitter spacetime, Synge’s world function is known in closed form and is 
closely related to the geodesic distance on five-dimensional Minkowski spacetime. In 
fact, since the chord length between two points x, x' on de Sitter space considered as 
the hyperboloid (2.14) is 


Z(x, x') = H 2 r) al3 y a (x)y l3 (x / ), (2.15) 

Synge’s world function on de Sitter spacetime is given by 

cos (H \J 2ct(x, x 7 )) = Z(x, x') (2.16) 


for |Z| < 1, i.e., for x' are not timelike to x. Equation (2.16) can be analytically 
continued to timelike separated points x,x', whence we find 

cosh (H\J— 2cr(x, x 7 )) =Z(x, x') for |Z| > 1. 


The possible values for the chord length Z are illustrated in a conformal diagram of 
de Sitter spacetime in Fig. 2.4. In the cosmological chart, the function Z attains the 
simple form 


■M2 


Z(x, X 7 ) = 14- 


(t -» T ) - ( X -xQ 
2't't' 


r'2 


■ (x - X ) 


•M2 


2tt' 


(2.17) 


where x = (t, x) and x' = (t / , x'). Note that the fraction on the right-hand side is a 
rescaling of Synge’s world function on Minkowski spacetime by — 2tt / . 


2.3 Cosmology 

When studying cosmological problems on usually describes the Universe by a homoge¬ 
neous and isotropic spacetime, t.e., one assumes the Friedmann-Lemaitre-Robertson- 
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Figure 2.4. De Sitter spacetime conformally mapped to a cylinder. The cylinder is 
unwrapped and the left and right dotted edges must be identified, x' is any (fixed) 
point of de Sitter spacetime and Z(x,x') is shown for all choices of x. The dotted 
line represents Z = 0. See also [10, Fig. 2]. 

Walker model. This standard prescription underpins the so called standard model 
of cosmology, the ACDM model of cold dark matter with a cosmological constant. 
However, from the presence of structure in the Universe (e.g., galaxy clusters, galax¬ 
ies, stars, etc.) we can directly deduce that the Universe is neither homogeneous 
nor isotropic. Instead it is believed that one can describe the Universe as nearly 
homogeneous and isotropic on cosmological scales so that observed Universe can be 
modelled as a perturbation around a FLRW spacetime and homogeneity and isotropy 
hold in an averaged sense. 

The cosmic microwave background (CMB) and the galaxy distribution are often 
believed to give a direct justification of this idea, but since all observations are 
along the past light cone and do not measure an instantaneous spatial surface one 
can only directly observe isotropy. The link to homogeneity is less clear but if all 
observers measure an isotropic CMB, it can be shown that the spacetime is also 
homogeneous. In this respect, the current cosmological model is heavily influenced 
by the philosophical paradigm in cosmology, the Copernican principle, that we live in 
no distinguished region of the Universe and that other observers would observe the 
same. But even under this additional assumption it is not completely clear that near 
homogeneity follows since the CMB is not exactly but only nearly isotropic; see [78, 
80, 148, 205] for a discussion of this issue. 

Ignoring these shortcomings, we follow the standard approach and assume that 
we live in an ‘almost’-FLRW spacetime, i.e., a universe which is correctly described 
by a perturbation around a FLRW background. Because such an assignment of a 
background spacetime to the physical perturbed spacetime is not unique, one has to 
deal with a gauge problem. 

Accordingly, we will begin by introducing the FLRW model and its properties. 
Then we will study the general gauge problem and its application in the case of a 
perturbation around a FLRW spacetime. 

2.3.1 Friedmann-Lemaltre-Robertson-Walker spacetimes 

One of the simplest solutions of the Einstein equation is the Friedmann-Lemaitre- 
Robertson-Walker (FLRW) solution, which describes a homogeneous and isotropic, 
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expanding or contracting universe. Let (M, g) be a spacetime, which is still to be 
determined, with a preferred flow (time-orientation) u. As a consequence of (spatial) 
isotropy, i.e., the absence of any preferred (spatial) direction, vorticity, shear and 
acceleration have to vanish: 


^ab 6, tJ a j, 0, U a 0. 

Also the anisotropic terms of the stress-energy tensor have to vanish, i.e., 

9a 6, a b 6 

so that the stress-energy tensor takes the perfect fluid form (2.7). Consequently the 
FLRW model is completely determined by its energy density p, pressure p and the 
expansion 0. 

Since vorticity and acceleration vanish, the spacetime (M, g) is foliated by surfaces 
27 orthogonal to u, which are required to be homogeneous by assumption and thus 
p, p and 0 are constant on these surfaces. For the same reason there exists locally 
a time function t, called cosmological time, that measures proper time, defined up 
to a constant shift, such that u a = — V a t. Henceforth we will always assume the 
time function exists globally so that the resulting spacetime is stably causal and, in 
particular, globally hyperbolic. 

One can show that the projected Ricci tensor, i.e., the Ricci tensor for the spatial 
slices 27 (c/ (2.3)), simplifies significantly to 

R ab = I Rhab = I(p + 71 - |0 2 )h ab . 

We may recast this equation into the more familiar form of the first Friedmann 
equation 

H 2 = 1 -(p + A- 1 -R). (2.18) 

with the famous Hubble parameter or Hubble function H = 0/3 = a/a. The second 
Friedmann equation is a special case of the Raychaudhuri equation (2.13) and reads 

H + H 2 + |(p + 3p) — | A = 0. (2.19) 

These two equations can be complemented with the energy conservation equation for 
the perfect fluid (2.8a) to show that Ra 2 is a constant. 

We define K = Ra 2 / 3 and notice that the initial value for the scale factor a > 0 is 
arbitrary so that we can restrict its value to K = — 1,0, +1. The sign of K determines 
the local geometry of the spatial sections: K = — 1,0, +1 correspond respectively to a 
hyperbolic, a flat and a elliptic geometry. The topology of the spatial sections is not 
completely determined by K and, in fact, there are many possibilities. While K = +1 
implies that the spatial sections are compact, both compact and non-compact spatial 
section are possible for K = — 1,0. 

Which of these three distinct values for K is realized depends on the energy 
density contained in the universe. If the energy density takes on the critical value 
p = p c = 3 H 2 — A, the spatial surfaces will be flat, while p > p c leads to a spherical 
and p < p c leads to a hyperbolical geometry. Furthermore, according to (2.19), an 
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accelerating expansion (a > 0) occurs when p + 3p < 0 (assuming A = 0), i.e., when 
the strong energy condition is violated. 

A metric tensor realizing the FLRW universe in a local comoving coordinate frame 
x M = (t, r, 6, 0) with respect to u a (i.e., u = d t ) is given (locally) by the FLRW metric 


g = —dt (g> dt + a(t) 2 (dr <8> dr + fM) 2 (d@ <8> d9 + sin 2 6 d0 <g> d0)), (2.20) 


where 


( . 
sinr 

for K = +1 

r 

for K = 0 

sinh r 

for K = — 1 


Mr) = - 


Therefore, a spacetime is a FLRW spacetime if and only if its metric attains locally the 
form (2.20) in some coordinate system and the time-orientation is given by dt. 

Let us define conformal time r via dT = d t/a. That is we set 


r t 


= T 0 - 


rh 


to a ^ 


— dt 7 or t (t) = 


a(t0 


— dt' — Ti 


with arbitrary t 0 ,t 1 and (possibly infinite) t 0 , t 1 such that the integral converges. 
Rewriting (2.20) with respect to conformal time, we obtain the alternative metric 
tensor, the conformal FLRW metric, 


g = cz(t) 2 ( — dr O dT + dr <8> dr + /x( r ) 2 (d0 <8> d0 + sin 2 9 d0 <g> d0)) (2.21) 


and thus we notice that a flat FLRW universe is locally conformally isometric to 
Minkowski space. 

Throughout this thesis we mostly work with flat FLRW universes and will fix 
M ~ M 4 in that case. Therefore we can choose globally Cartesian coordinates for the 
spatial sections so that we have 

g = —dt <8> dt + a(t) 2 5 i; - dx' <8> dx ; = a(r) 2 ( — dr ® dz + 5 I; - dx l (g> dx ; ), 


where the coordinate functions x l range over the entire real line. 


2.3.2 Gauge problem 

The correspondence of a background spacetime (M, g 0 ) to the physical spacetime (M, g) 
is equivalent to the specification of a diffeomorphism xp : M —* M. Given any other 
diffeomorphism xp' between the background spacetime and the physical spacetime, 
we can construct Lp = xp~ l o xp'. For a tensor field S on M to be gauge invariant we 
require that tp*S = S. Then it holds that 

5S = S — xp*S = S — xp'*S 

and we say that the perturbation 5S is a gauge invariant quantity. Otherwise, the 
perturbation 5S is completely dependent on the mapping xp and even if xp is specified 
it will not be an observable quantity unless the correspondence xp itself has been 
specified via an observational procedure (e.g. via an averaging approach, see the 
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discussion in [79]). Therefore the only possibilities for a tensor field 5S to be gauge 
invariant is that S is a constant scalar, a zero tensor or a product of Kronecker 
deltas [204, Lem. 2.2]). 

A slightly different picture which clarifies the perturbation aspect of the gauge 
problem is sometimes more helpful. Since the local flow of any vector field v on M is 
a diffeomorphism ip e : M —* M, where e is contained in a sufficiently small interval 
around 0, we can describe the gauge problem alternatively in terms of vector fields. 
Given a (differentiable) tensor field S, we have by definition 

2 %S = lim -(t b*S - S) 

£ *o e 


or, in other words, 

xp* E S = S + s% v S + 0(e 2 ). 

The gauge choice is now encoded in v, which is completely arbitrary, and we see that 
S is gauge invariant to first order if only if SC V S = 0 for all v. For S to be exactly gauge 
invariant (as discussed above) it must hold that 2?"S = 0 for all n. 

In the light of this discussion, two approaches to perturbations of FLRW spacetimes 
seem expedient: Since any quantity describing the inhomogeneity or anisotropy of 
the perturbation of the FLRW spacetime must vanish on the background, it will be 
gauge-invariant. This leads to the “1 4- 3 covariant and gauge-invariant” approach 
of [79, 119, 147]. The alternative and more commonly used approach due to [25, 
140] constructs gauge invariant quantities directly from the perturbed metric and 
stress-energy tensor. 

2.3.3 Decomposition of tensor fields 

Before we can discuss metric perturbations, we need to investigate the decomposition 
of vector and rank-2 tensor fields into their ‘scalar’, ‘vector’ and ‘tensor’ parts [203], 
Consider a non-compact, 4 boundaryless, orientable 3-dimensional Riemannian 
manifold (27, y) with covariant derivative denoted by a vertical bar (e.g., 4 >|;). Using 
the Hodge decomposition theorem, we can uniquely decompose any sufficiently fast 
decaying smooth one-form B as 

Bi = 4>\i + Si, (2.22) 

where cf) is a scalar function and S ; is divergence-free. 

Any (0,2) tensor field C can be decomposed as 

Q j = l^ijT k>c kl + c [ij ] + c (ij)> 

i. e. , into its trace, its antisymmetric part (equivalent to a vector field via the Hodge 
operator *) and its trace-free symmetric part C^jy According to [53, Thm. 4.3], as a 
consequence of the Fredholm alternative and an application of (2.22), a sufficiently 
fast decaying C can be further uniquely decomposed so that 

c (ij) = 4>\(ij) + s (i|j) + hij, (2.23) 

where 0 and S are as before and h is a trace- and divergence-free (0,2) tensor field. 


4 The decomposition works also for compact manifold but is non-unique in that case. 
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2.3.4 Metric perturbations 

As background spacetime we take (M, g 0 ) with g 0 the flat FLRW metric in conformal 
time, i.e., 

g 0 = a 2 (—dx 8> dr + 5 i; - dx 1 8> dx J ), 


which will be used throughout this section to raise and lower indices. Then we define 
the perturbed FLRW metric 

a~ 2 g = —(1 + 2<h)dT dT + B, (dx 1 8 > dr + dr 8 > dx 1 ) 

‘ . (2.24) 

+ ((1 - 2t/))5 i; - + 2Cij) dx ! (8) dx J 

where the scalar fields </>, ip, the 3-vector field B and the trace-free 3-tensor field C 
are considered ‘small’, i.e., (2.24) should be understand as a 1-parameter family of 
metrics g e and each of the perturbation variables as multiplied with a small parameter 
e. However, to avoid cluttering the equations unnecessarily, one usually omits the e. 

The ten degrees of freedom encoded in these four quantities exhibit the full gauge 
dependence. Before studying the behaviour of g under gauge transformations, let us 
rewrite (2.24) using the decompositions (2.22) and (2.23): 

B i = B ,i ~ s i> c ij = E ,ij + E {i,j) + \ E ij 


with two scalar fields E,B, two divergence-free 3-vector fields S, F and a trace-free, 
transverse 3-tensor field h. Therefore, 

a~ 2 g = — (1 + 2(b) dx 8) dT + ( B .- — S.) (dx ! 8) dT + dT 8> dx ! ) + 

’ . (2.25) 

+ ((1 - 2ip)5ij + 2E i j + 2F(ij) + h^) dx 1 8) dxh 

This decomposition allows us to consider three types of perturbations separately, 
namely, the scalar perturbations caused by c p,ip,E,B , the vector perturbations due to 
S, F and the tensor perturbations caused by h. The inverse of the perturbed metric 
(2.24) up to first order is 

a 2 g ~ 1 = -(1 - 20) 8) d T +B l (3 ; 8) 3 T + d T 8) d t ) + ((1 + 2t p)5 l i - 2 C !J ) d t 8) dp 


Let us now determine the transformation behaviour of these perturbation variables 
by calculating the gauge dependence of g up to linear order, i.e., the dependence 
= 2V (a ^ b) = 2°V (a ^ b) + 0(e) on ^ = (£°, E,’ 1 + ? ! ) with = 0, where 
°V is the covariant derivative on the background spacetime. Hence we have the 
transformations: 


Soo -* Soo ~ 2a 2 (^ 0/ + 5f^°), 

sot^g o i +a 2 (?; i + ?;-? 0 ! ), 

Sij Sij + 2a2 (? ; (ij) + + WSijt; 0 ), 


where we define the conformal Hubble parameter IK = aH = a'/a and a prime denotes 
a derivative with respect to the conformal time (e.g., a' = d z a). It follows that the 
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perturbation variables transform as 

E^E + S 

F i ^F i + Si 


(P^(P+S 0/ + WS° 
B^B + S'-S 0 
St^s.-s' 


and hij —> hij. 

Constructing linear combinations of the perturbation variables, we can now 
construct several first-order gauge invariant quantities. Two simple (first-order) 
gauge-invariant functions characterizing the scalar perturbations are the Bardeen 
potentials <P, \B [25] 


<& = (p —Wet — a', (2.26a) 

V^xP+Wcr (2.26b) 


in terms of the shear potential cr = E' — B. 



Analysis 

Summary 

The contents of the present chapter may be generously subsumed under “analysis”, 
hence the title. We begin with a review of topological vector spaces (Sect. 3.1) 
starting from the basics of topology and locally convex topological spaces with the 
particular example of function spaces, and finishing with a discussion duality pairings 
and tensor products. The material presented in this section is a excerpt of the results 
found in the books [133, 137, 176, 195, 211]. As always, although we refer the 
reader to the books cited above for proofs of the various statements, care has been 
taken to present the material in a structured way so that no results should appear 
surprising. 

The second section (Sect. 3.2) concerns the theory of ^-algebras and thus forms 
the foundation of the algebraic approach to quantum field theory to be discussed 
in the later chapters. Here we will discuss the general features of *-algebras and 
(7 -algebras, states with an emphasize on the Gel’fand-Naimark-Segal reconstruction 
theorem, and the Weyl C*-algebra. For this section we refer the reader to the books 
[99, 130], Details on the Weyl algebra can be found in [24, 151, 158, 200], 

Functional derivatives have always played an important role in physics but on 
infinite dimensional spaces, which appear naturally in quantum field theory, they 
are very subtle. We introduce two different notions of functional derivatives in the 
third section (Sect. 3.3): the directional, or Gateaux, derivative and the Frechet 
derivative. We will show that the two derivatives are closely related. Proofs and more 
information on the directional derivative can be found in [116, 163], 

In the fourth section (Sect. 3.4) we will pick apart the Banach fixed-point theorem 
and prove several statements on the existence and uniqueness of fixed-points and 
their properties. These results will form the basis of the proof existence of solutions 
to the semiclassical Eisntein equation to be presented in Chap. 8. Some of the results 
presented in the fourth section are already shown in [3] by Pinamonti and the author. 

The theory of distributions plays a fundamental role in quantum field theory: 
quantum fields can be understood as distributions. Therefore, we will discuss in 
detail distributions and microlocal analysis in the fifth section (Sect. 3.5). That is, 
we will review the basic definition of distributions and distributional sections on 
manifolds, the nuclearity property of the associated function spaces and the Schwartz 
kernel theorem, the Fourier transform and Schwartz functions and distributions, 
the wavefront set of distribution and distributional sections, including its behaviour 
under various operations such as pullbacks, and finally the important propagation of 
singularities theorem. Good references for this section are the books by Hormander 
[125-128] and also [207, 211], Several recent results on the properties of spaces of 
distributions may be found in [54]. An excellent introduction to the wavefront set 
with several examples is [41] by the same author. 
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In the last section of this chapter (Sect. 3.6) we discuss wave equations. Since 
bosonic quantum field typically satisfy a wave equation and the Dirac-type equations 
satisfied by fermionic quantum fields are closely related, an understanding of the 
solutions of these equations is very important. With some minor modifications we 
mostly follow [23, 24] to introduce the advanced, retarded and causal propagators of 
normally and pre-normally hyperbolic differential operators and their relation to the 
Cauchy problem. Several extensions of the results in [23, 24], that are also partially 
stated here, can be found in [160, 220], 


3.1 Topological vector spaces 

3.1.1 Topology 

A topological space is a set X of points with a notion of neighbourhoods. More 
concretely, besides X it consists of a collection z of subsets, the open sets, such that 

(a) both 0 and X are open, 

(b) the union of any collection of open sets is open, 

(c) any finite intersection of open sets is open. 

We call the collection z the topology of X; examples are illustrated in Fig. 3.1. 

There are two topologies that can be defined for every set. The discrete topology 
of a set contains all its subsets, whereas the trivial topology consists only of the empty 
set and the set itself. 

Given two topologies z and z' on the same set, we can compare them: If t c z', 
we say that z' is finer than z and that z is coarser than z'. It follows that for any set 
the trivial topology and the discrete topology are respectively the coarsest and finest 
possible topology. 

The complements of the open sets are the closed sets. By the de Morgan laws, 
they have the following properties: both 0 and X are close, the intersection of any 
collection of closed sets is closed and any finite union of closed sets is closed. It is 
possible for a set to be both closed and open or neither. If the only sets in X that are 
both open and closed are 0 and X, then X is connected. 

The closure cl U of a set U c X is the intersection of all closed set that contain U. 
The subset U is called dense inX if its closure isX: cl U = X. 

A neighbourhood of a point x e X is an open set U e z that contains x. If 
for each pair of distinct points rj in a topological space X there exist disjoint 
neighbourhoods U and V of x and y, then it is called a Hausdorff space (Fig. 3.1). 
Obviously, endowing a set with the discrete topology turns it into a Hausdorff space. 

A basis (or base) of a topological space X is a collection 53 of open sets in X 
such that every open set inX can be written as the union of elements of 53; one 
can say that the topology ofX is generated by 53. If the basis ofX is countable, we 
say that X is second-countable. A local basis 53(x) of a point r el is defined as a 
collection of neighbourhoods of x such that every neighbourhood of x is a superset 
of an element of the local basis. The union of all local bases is a basis. If every point 
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(a) (b) (c) 

Figure 3.1. Collections of subsets of two points: (a) is a topology, (b) is not a topology 
(the empty set and the whole set are missing), (c) is the discrete topology (thus a 
Hausdorff topology). 

has a countable basis, we say thatX is first-countable. Clearly, a second-countable 
space is first-countable but the implication cannot be reversed. 

We say that a map / : X —* Y between two topological spaces X, Y is continuous 
if for all x eX and all neighbourhoods V of /(x) there is a neighbourhood U of x 
such that /([/) c V. The space of all continuous maps between X and Y is denoted 
C(X, Y) or C°(X,Y) and by C(X) = C°(X) if Y = M. A bijective map between two 
topological spaces X,Y is a homeomorphism is both / and/ -1 are continuous. X and 
y are then called homeomorphic, i. e ., they are topologically equivalent. 

LetX be a set and /, i e / a family of topological spaces with topologies Given 
maps fi : X —* Yj, the initial topology on X is the coarsest topology such that the 
/i are continuous. It is generated by the finite intersections of {/©([/) | U e t,}. 
Examples of the initial topology are the subspace topology, i.e., the topology induced 
on a subspace X c Y by the inclusion map i : X » Y, and the product topology, i.e., 
the topology induced on a product space X = Oiez by the projections n i : X —* Y ; . 
Conversely, given maps / ; : Yj —*X, the final topology onX is the finest topology such 
that each f t is continuous. It is given as 

t = {I/cX | fr\U) e Tj Vi e I}. 

Important examples are the quotient topology on a quotient space X = Y/ ~ with the 
map given by the canonical projection Y —* Y / ~ and the direct sum topology on the 
direct sum X = X;g/ given by the canonical injections Y ; —»X. 

Another application for the initial topology is the topology induced by a pseudo¬ 
metric: A pseudometric on a set X is a map ^ d(- ,■): X x X ^> M such that for all 
x,y,z eX: 

(a) d(x,x) = 0, 

(b) d(x,y) = d(y,x) (symmetry), 

(c) d(x,z) < d(x,y) + d(y,x) (triangle inequality); 

the set X together with d is a pseudometric space. If the pseudometric satisfies 
d(x, y) > 0 (positivity) for all x^fy, then it is a metric and X is a metric space. The 
pseudometric on X induces the initial topology on X which is generated by the open 
balls around each point: 

®(y) = {* ex I d(x,y) < r}. 

A map / : X —» Y between two pseudometric spaces with pseudometrics d x , d Y is 
called an isometry if d x (x,y ) = d y (/(x),/(y)) for all x,y eX. 
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Given a topological space X, a sequence of points (x n ) inX converges to a point x e 
X if every neighbourhood U of x contains all but finitely many elements of the 
sequence. IfX is a pseudometric space, it is called complete if every Cauchy sequence 
with respect to its pseudometric converges to a point in X. Every pseudometric 
space X that is not complete can be completed. Namely, the completion of X is the 
(essentially unique) complete pseudometric space X of which X is a dense isometric 
subspace. 

A topological (sub)space X is called compact if each of its open covers, viz., a 
collection of subsets whose union contains X as a subset, has a finite subcollection 
that also contains X. lfX is compact, then every subset Y c X is also compact in the 
subspace topology. The space X is locally compact if every point in X has a compact 
neighbourhood. Furthermore, X is called cr-compact if it is the union of countably 
many compact subsets. Any compact space is locally compact and cr-compact; the 
converse is, however, false. One can also show that every second-countable and 
locally compact Hausdorff space (thus, in particular, every topological manifold) is 
cr-compact. We say that a map / : X —» Y between two topological spaces X, Y is 
proper if the preimage of every compact set in Y is compact in X. 

A related concept is that of boundedness. A subset U c X of a pseudometric 
space X is bounded if for each x,y € U there exists a r such that d(x,y) < r. 

3.1.2 Locally convex topological vector spaces 

A topological vector space is a K-vector space X such that the vector space operations 
of addition and scalar multiplication are (jointly) continuous with respect to the 
topology ofX. Since addition is continuous and the topology therefore translation- 
invariant, the topology of A is completely determined by the local basis 53(0) at the 
origin. For the same reason one can show that X is Hausdorff if and only if {0} is 
closed. Note that in a Hausdorff topological vector space every complete subset is 
also closed. The (topological) vector space X is called convex if x,y eX implies 
Ax + (1 — A )y eX for all A e [0,1], 

LetX, Y be topological vector spaces and W c X a subset. A mapping f :W —>Y 
is uniformly continuous if to every neighbourhood V of zero in Y there exists a zero 
neighbourhood U c X such that for all x, y e W 

x-y eL7 => /(x)-/(y)eV. 

Every uniformly continuous map is already continuous but the converse is not true. 
However, if / is linear and also W is a vector subspace, then continuity also implies 
uniform continuity. Moreover, if W is a dense subset, then to every uniformly 
continuous map / : W — * Y there exists a unique continuous map / :X —» Y which 
extends f. The extension of a linear map from a dense vector subspace is even 
uniformly continuous and linear. 

It is usually desirable to have topological vector spaces with additional structures: 
A seminorm on a K-vector space X is a map || • || : X —» R such that for all xjeX 
and A e K: 

(a) ||x|| > 0 (positive-semidefiniteness), 
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(b) 11Ax11 = |A| ||jc|| (absolute homogeneity), 

(c) ||x +y|| < ||x|| + ||y|| (triangle inequality). 


If the seminorm satisfies ||x|| > 0 (positivity) for all x/0, then it is a norm and the 
vector space is called normed space. Note that each (semi)norm induces a (translation- 
invariant) (pseudo)metric d(x,y) = ||x — y||. 

Analogously to the pseudometrics, given a family of seminorms {|| • || t } ieI on a 
vector space X, they induce the initial topology on X. More explicitly, the topology is 
generated by all finite intersections of {x el | 11x11, < r}, the open balls around the 
origin. 1 If 7 is countable, we can assume that I c N and the topology above is the 
same topology as the one induced by the metric 


d(x,y) 


Z 


i ll*-yllfc 
2 k l + Ik-yllfc’ 


where the factors 2~ k may be replaced by the coefficients of any convergent series. 

We say that a family of seminorms {|| • || ; } on a vector space X is separating if for 
every nonzero xeX there exists an i such that ||x||; >0. It is immediate that a vector 
space with topology induced by a family of seminorms is Hausdorff if and only if the 
seminorms are separating. 

LetX be vector space endowed with a family of seminorms {|| • Then we 

can define the following topological vector spaces in order of generality: 


1. if the family of seminorms is separating, then X is a locally convex (topological 
vector) space; 

2. if, in addition, I is countable and X is complete with respect to each of its 
seminorms, then X is a Frechet space; 

3. if, in addition, the family of seminorms consists of only one norm, thenX is a 
Banach space. 


Finally note that on all these spaces above the Hahn-Banach theorem can be 
applied. That is, given a K-vector space X with a seminorm || • || and a linear form 
/:[/—» K on a vector subspace U such that |/(x)[ < ||x|| for all x e U, there 
exists a (generally non-unique) linear form / : X —* K which extends / such that 
l/wi<iixii. 


3.1.3 Topologies on function spaces 

Important vector spaces are subspaces of the space of functions 

Y x = {f :X ^Y} 

between a setX and a topological space Y. Y x can be equipped with the topology 
of pointwise convergence, which is just the product topology with the projections 

1 Conversely, given a basis of the origin it is possible to construct a family of seminorms from its 
elements. 
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n x : Y x —* Y,f >-» fix). In this topology a sequence of functions (/ n ) converges to 
some / if and only if each / n (x) converges to /(x) at all x. 

If (T, d) is a pseudometric space, the space of functions can be equipped another 
topology, the topology of uniform convergence. In this case, a subspace Z <ZY X can be 
endowed with the pseudometric 

dif, g) = sup d (/(x), g(x)) 

xex 

for all /, g € Z, which induces a topology for Z. Note, however, that Z with this 
topology is not really a topological vector space because multiplication will fail 
to be continuous unless Z is a subset of the bounded functions. A sequence of 
functions (/„) converges to some / if and only if for every e > 0 there exists a N such 
that d (/„,/) < e for all n>N. 

If X is a topological space, yet another topology on subspaces Z c Y x is the 
compact-open topology. For all compact K c X and open U c Y it is generated by the 
finite intersections of 

{/ e Z | /(fQ C [/}, 

i.e., the set of functions that carry compact subsets into open subsets. The compact- 
open topology is finer than the topology of pointwise convergence. If ( [Y,d) is a 
pseudometric space, the compact-open topology is the initial topology induced by the 
pseudometric on compact subsets, i.e., it is generated by the finite intersections of 

{/ e Z | sup xeK d (/(x), 0) < r}. 

Therefore it is also called the topology of uniform convergence in compacta and, if A 
is compact, it is the same as the topology of uniform convergence. It follows that a 
sequence of functions (/„) converges to some / if and only if for every e > 0 and 
compact K <ZX there exists a N such that d(/ n (x),/(x)) < e for all n > IV and x e K. 

Again, suppose that X, Y are topological vector spaces and Z CY X . Z is called 
equicontinuous if for every neighbourhood of the origin U c X there exists a neigh¬ 
bourhood of the origin V cl such that /( U) c V for every / e Z. Equicontinuity for 
a set of one element is of course the same as continuity. 

3.1.4 Duality 

A duality or dual pairing {Y,X) is a triple ( X, Y, (•,•)) of two vector spaces X, Y and a 
non-degenerate bilinear form (•,•): Y x X —* K, i.e., 

{y, x) = 0 for all y e Y implies x = 0 
(y, x) = 0 for all x e X implies y = 0. 

The standard example of a duality is that between a vector space X and its algebraic 
dualX*, where the pairing is given by the canonical bilinear form 

{• ,•) :X* xX —*K, (/, x) ~ (/, x) = f(x). 
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A more important example is that of the topological dual X' c X* of a topological 
space X, which consists of all continuous linear maps. Note that the pairing ( X',X ) is 
not a proper duality unless X is Hausdorff because the restriction of the canonical 
bilinear form to X' x X is non-degenerate if and only if A is Hausdorff. 

For each tel the map x >-» (y, x) gives an injective map of X into Y* and an 
analogous construction embeds Y into A*. In the following, the identification of A 
with a subspace of Y* and Y with a subspace of X* will always be tacitly assumed 
unless otherwise noted. 

In particular, X is a subspace of K 7 and can therefore be equipped with pointwise 
topology. This locally convex Hausdorff topology is called the weak topology on X 
with respect to ( Y,X ); statements for weak topology will often be indicated by the 
adjective “weakly”. The weak topology is the coarsest topology such that x —» (y, x) 
is continuous for all y e Y and one finds Y = X' with respect to the weak topology 
onl. Moreover, if Y is locally convex, the seminorms on Y yield dual seminorms 
on X given by 

IMI i = sup{|(y,x)| | y eT with ||y|| f < 1}, 
which also induce the weak topology. 

The statements above can also be made with the role of X and Y interchanged 
to introduce the weak topology on Y with respect to (Y,X). In particular, if Y =X' 
and (•, •} the canonical bilinear form, then X' with the weak topology is called the 
weak dual. Furthermore, given a subset Y c X* , then the induced pairing between Y 
and X is non-degenerate if and only if Y is weakly dense in X*. Thus any Y is weakly 
complete if and only if Y =X*. 

Another (locally convex Hausdorff) topology onl induced by a duality ( Y,X) 
with a locally convex space Y is the strong topology, which is the topology of uniform 
convergence on the bounded subsets of Y; statements for strong topology will be 
often be indicated by the adjective “strongly”. It is induced by the family of seminorms 

IMIb = sup|(y,x)| 

yeB 

for each bounded set B c Y. Again, we can interchange the role of X and Y and call 
X' endowed with the strong topology induced by the canonical pairing the strong 
dual. 

If we equip X with the strong topology with respect to ( Y,X ), then the map 
x >-» (y,x) will not be continuous for any yeF. The finest topology onl such that 
this map is continuous is called the Mackey topology but it will not concern us here 
any further. 

Finally, note that the dual of a Banach space is always a Banach space, but the 
dual of a Frechet space that is not Banach is never a Frechet space. 

3.1.5 Tensor products on locally convex spaces 

Given two locally convex topological vector spaces X, Y there are many different ways 
to define a family of seminorms for the space X <E> Y. Therefore there is no natural 
topology for X <S> Y if X or Y is infinite-dimensional, whence one speaks of different 
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topological tensor products. The most common topological tensor products are the 
projective and injective tensor product introduced below. 

The projective tensor product topology equips the algebraic tensor product X <g> Y 
with the finest topology such that ®:Xx7->X07 is jointly continuous. That is, 
it is the final topology defined by the projections n x ■ X —>X ®Y, n Y :Y —> X ®Y. 
Equivalently, the topology is induced by the seminorms 




= inf { 2 k^ Xk Hill' Yk h u = Y^k Xk ® Yk } 


for all u e X ® Y and where the infimum runs over all representations of u. The 
resulting locally convex space is usually denoted X Y and its completion X <8> n Y. 

A coarser topology is defined by the injective tensor product topology, it is the 
finest topology such that ® :X x 7 is separately continuous. Let X', Y' be 

the weak duals of X, Y. Note that X <g> Y can be embedded into the space of bilinear 
separately continuous maps X' x Y' —* K, denoted B(X, Yj, with the topology of 
uniform convergence U x V on all equicontinuous sets U c X and V c Y. That is, 
the topology is generated for all U x V and open I c K by the finite intersections of 


i/£B(X,7)|/([/xV)cl}. 


X <g> Y can now be endowed with the corresponding subspace topology. Seminorms 
that induce this topology are given by 

IMIij = sup{|(/ ® g)(u)| | f <=X',g&Y' such that ||/||,- = ||g|| ; - = 1}. 

The space X <8> Y equipped with the injective topology is usually denoted X ® e Y and 
its completion is denoted X ® e Y. Observe that C{X, T) ~ C(X) T if Y is complete. 

Locally convex spaces on which the injective and projective tensor product agree 
are called nuclear. More precisely, we say that a locally convex space X is nuclear if 

X 0 £ Y =X Y or, equivalently, X® E Y=X® Ji y 

for every locally convex space 2 Y in which case we simply write X ® Y. If both X and 
Y are nuclear, then also X ® Y is nuclear. Moreover, if a subspace of a nuclear space 
is nuclear and the quotient space of a nuclear space by a closed subspace is nuclear. 

A more useful characterisation of nuclear spaces is in terms of summable se¬ 
quences. Denote by l x (X) the X-valued summable sequences, i.e., the set of sequences 
(x n ) in A such that all unordered partial sums Xms/ci i x n converge in A. Further, 
denote by t\{X] the X-valued absolutely summable sequences, i.e., the set of sequences 
( x n ) inX such that < oo for all seminorms || • || ; ofX. ThenX is nuclear if 

and only if 

t l {X) = t l {X} 

and hence, by the above observation, both sides are equal to i = t\® e X = i\® n X. 

In other words, X is nuclear if and only if every summable sequence in X is already 
absolutely summable. Nuclear spaces are therefore very similar to finite-dimensional 


2 Actually it is sufficient to check equality for Y = 1 15 see below. 
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spaces and, while every finite-dimensional locally convex space is nuclear, no infinite¬ 
dimensional normed space is. 

We finish this section by stating what can be called the abstract kernel theorem for 
Frechet spaces: 

(X0Y) / ~X / 0Y / and (X® Y) 7 ® Y 7 

for every X,Y such that X (or Y) is nuclear and where all duals are strong duals. 


3.2 Topological ^-algebras 

A topological *-algebra is a topological algebra si over C, i.e., a topological C-vector 
space with a separately continuous ring multiplication, together with a continuous 
involution *. That is, there is an automorphism 

* : si —» si, x <-> x*, 


which is antilinear and involutive such that 


(a) (ax + by)* = ax* + by*, 

(b) (xy)* = .y*x*, 

(c) (x*)* = X 


for all x,y e si and a, b e C. If, in addition, si has a multiplicative unit 1, we say that 
si is a unital *-algebra. Elements x, y of the algebra si are called 


adjoint 

self-adjoint 

normal 

unitary 


if x* = y, 
if x* = x, 
if x*x = xx*, 
if x*x = 1 = xx*, 


where unitarity obviously requires the existence of a unit element. Note that 1 is 
always self-adjoint. 

A *-subalgebra 3 c si is called a left (right) * -ideal if yx (resp. xy) is in 3 for all 
y e 3 and x e si. If the subalgebra is both a left and right *-ideal, it is just called a 
(two-sided) *-ideal. It follows that an ideal 3 of si is a *-ideal if and only if 3* = 3. 

The homomorphisms that arise between *-algebras, called *-homomorphisms, are 
those that preserve in addition to the multiplicative also the involutive structure, 
i.e., a map a : si —» 88 is a *-homomorphisms if it is an algebra-homomorphism 
and a(x*) = a(x)* for all x e si. If the *-algebras are unital, we also demand that 
*-homomorphisms be unit-preserving. 

Often one needs a *-algebra which also has the structure of a normed vector space. 
In the case of *-algebras, it makes sense to require the norm to satisfy an additional 
property: A norm || • || : si —* R is said to be a C*-norm if 

llx*x|| = ||x|| 2 
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for all x e si. This differs from some definitions of C*-norms because, in fact, every 
C*-norm is automatically a ^-isomorphism and submultiplicative, i.e., 

I|x*|| = ||x|| and ||xy|| < ||x||||y|| 


for all x, y e si [199]. 

If a *-algebra si comes equipped with such a C*-norm || • || that turns si into a 
Banach space, then it is called a C*-algebra. In a C*-algebra, ring multiplication 
and inversion are continuous operations with respect to the norm; the continuity 
of addition, scalar multiplication and involution are obvious. The condition for a 
unital C*-algebra to have a C*-norm imposes such strong conditions on its algebraic 
structure that the algebra uniquely determines the norm. Namely, 

||x|| 2 = ||x*x|| = sup {|A| | x*x — A1 is not invertible} 

for every x e si. A *-homomorphism a : si — * 28 between two unital C*-algebras is 
thus always norm-decreasing: ||a(x)|| < ||x||. 

3.2.1 States 

Given a *-algebra si, one can consider its algebraic dual, the space of linear functionals 
on si. A linear functional co e si* on a unital *-algebra si is positive if it satisfies 

a>(x*x) > 0 

for all xesf. If, si is unital and o>(l) = 1, we say that co is normalized. A functional 
co that is both positive and normalized is called a state. If the *-algebra si comes 
equipped with a topology, we always assume that co is continuous, i.e., we consider 
the topological dual si' instead of the algebraic one; an algebraic state on a C*-algebra 
is automatically continuous with respect to the C*-norm. 

Given positive co e si', it follows that for all xjed 

m(x*y) = <x>(y*x), 

\co(x*y)\ 2 < a>(x*x)a>(y*y), 

where the second line is called the Cauchy-Schwarz inequality. If si is unital, the first 
equation implies that every positive co is hermitian: <x>(x*) = a>(x). 

A state co is pure if every other state rj that is majorized by it, co(x*x) > i 7 (x*x), 
is of the form rj = Xco with A e [0,1]. Consequently, a pure state cannot be written 
as the convex sum of two other states. States that are not pure are called mixed. 

The positive linear functionals equips si with a degenerate inner product via the 
antilinear pairing (x,y) = co(x*y) which can be turned into a pre-Hilbert space by 
taking the quotient by the degenerate elements. This is the essential content of the 
famous Gelfand-Naimark-Segal construction, usually abbreviated as GNS construction, 
which we will state after the following definition. 

A *-representation n of a *-algebra si is a *-homomorphism into the C*-algebra of 
linear operators on a common dense (with respect to the norm || • || = (•, •) 1 ^ 2 on the 
Hilbert space) domain 26 of a Hilbert space 76. Note that the ^-representation n is 
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continuous with respect to the uniform operator topology, i.e., the topology induced 
by the operator norm 


II^Hop = sup {||Tx|| | x e 93 with ||jc|| < l}. 

Moreover, if the domain 93 is complete in the graph topology induced by the family of 
seminorms || • || x = ||tt(x) • ||, we say the the ^-representation n is closed. 

If there exists a vector f2 e such that 7t(s$)f2 = {7t(x),f2 | x e s#} is dense 
in TL, then the * -representation is called cyclic and fl cylic vector. If 7t (sl)f2 is even 
dense in 93 in the graph topology, then n and 12 are called strongly cyclic. The state 
a>(x) = (tt(x)12, 12) defined by a cyclic vector 12 of a cyclic *-representation n is pure 
if and only if the only subspaces left invariant by n(A) are the trivial ones. 

Theorem 3.1 (GNS construction). Let cobea state 3 on a unital topological *-algebra si. 
Then there exists a closed (weakly continuous) strongly cyclic *-representation n of sf on 
a Hilbert space lf€ with inner product (• , •) and strongly cyclic vector 12 such that 

co(x*y) = (tt(x)12, n (y)12) 

for all x, y e si. The representation n is unique up to unitary equivalence. 

This is a standard theorem and a proof can be found in many places, e.g., in [180], 
Working with general ^-algebras, we have not excluded the case of ^representations 
onto unbounded operators. For that reason it is not possible to uniquely extend 
the representation to the whole Hilbert space, and hence self-adjoint elements of 
the algebra might not be represented by self-adjoint operators but only symmetric 
operators. These problems do not occur if one applies the GNS construction to 
C*-algebras. 

3.2.2 Weyl algebra 

Set V to be a R-vector space and cr : V x V —» R an antisymmetric bilinear form (i.e., 
a pre-symplectic form). 4 A Weyl *-algebra W for (V, cr) is a unital involutive algebra 
generated by (nonzero) Weyl generators W , i.e., symbols W(-) labelled by the vectors 
in V, which satisfy, for all v, w e V, the relations 

(a) W(v)W(w) = exp (^cr(v, w)) W(v + w), 

(b) W(vf = W(-v). 

Therefore the Weyl generators also have the following properties: 

(c) W(0) = 1, 

(d) W(v)* = W(—v) = W(v)“\ 

3 Actually it is enough for to to be a positive; the normalization is not necessary for the theorem to 
hold. 

4 We follow [151] and will not assume that cr is non-degenerate. In fact it is sufficient to assume 
that cr is linear in its first or second argument. 
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(e) (W(v)) veV are linearly independent. 

Since TUT is generated by unitaries, every ^-representation is necessarily by bounded 
operators. Moreover, between two Weyl *-algebras generated by Weyl generators W 
and W' for (V, cr) there exists a unique *-isomorphism a completely determined by 
ao W = W'. 

One can endow W with a C*-norm, the minimal regular norm 
||x|| = sup{co(x*x) 1/2 | co is a state on W} 

for all x e W. If the bilinear form cr is non-degenerate, one can show that all C*-norm 
over W are equal. We call the completion W of a Weyl *-algebra W, with respect 
to the minimal regular norm, the Weyl C*-algebra. It is unique up to ^isomorphism 
and, in particular, simple, viz., it has no non-trivial closed *-ideals, if and only if cr is 
non-degenerate [151]. 

The map K B A —* W(Av) is not continuous in °l/lf, because ||W(v) — W(w)|| = 
2 for all distinct v, w e V as a consequence of the spectral radius formula. A 
^representation n of °W on a Hilbert space 76 is called regular if the one-parameter 
unitary groups 

A >-» (tt o W)(Av), veV, 

are strongly continuous. If the ^representation induced by a state on °lf is regular, 
we also call the state regular. Invoking Stone’s theorem (tt ° W)(Av), we can find a 
family of self-adjoint operators F(-) on 76, labelled by vectors in V, such that 

(tto W)(Av) = exp (iAF(v)); 

the map F is called the field operator and is generally unbounded. 

A strongly regular state [23] is a regular state for which the operators F(v), veV, 
have a common dense domain 2) c 76, which is closed under the action of F, and for 
which v —> F(v)w is continuous for fixed w e V. For strongly regular states the field 
operator is linear in its argument and thus a self-adjoint operator-valued distribution. 

3.3 Derivatives 

Various different notions of derivatives on topological vector spaces exist in the 
literature, see [20] for a survey and history of the topic. On infinite-dimensional 
spaces these derivatives are inequivalent and care must be taken to make precise 
which derivative is meant. On Banach spaces there exists the notable example of the 
Frechet derivative. However, many spaces of interest in physics are not normed and 
so one must work with derivatives on more general spaces. Below we will define a 
directional derivative in the sense of Gateaux and later compare it with the Frechet 
derivative on Banach spaces. 

3.3.1 The directional derivative 

Let A, Y be two topological vector spaces and U cl open. The (directional) derivative 
of a function f : U — > 7 at r e [/ in the direction ft eX is defined as the map 
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dfz/W = d/(x;h) = lim-(/(x + eh)'-/(x)) = — /(x + <?h) 
£—>o e d£ 


(3.1) 


E =0 


if the limit exists. In particular, if / is a continuous linear function, then its derivative 
is d/(x;h) = /(h). 

Note that the nomenclature here follows that of [116, 163] and differs from that 
in [3], where Pinamonti and the author called the same derivative Gateaux derivative. 
The reason for this choice of a more neutral name is that the name “Gateaux derivative” 
has sometimes been used for slightly different derivatives. However, all definitions 
known to the author agree whenever the derivative is both linear and continuous in 
the direction of the derivative. 

It should be clear from the definition of the directional derivative, that the ordinary 
and partial derivative are special cases of the directional derivative for functions from 
Euclidean K n to K or C. Consequently the directional derivative is also closely related 
to the local form of the covariant derivative given by a connection on a vector bundle. 

The function / is called differentiable at x if the limit exists for all h e X and 
simply differentiable if it is differentiable at every x e U. Moreover, / is continuously 
differentiable or C 1 on U if the map df is continuous (in the induced topology 
on 1/ xl). Higher derivatives may be defined recursively by 


d, •••d 


hJM = d K...,h n fM = !™ 0 e (x + eK) ~ d L..A-/«) 


and we say that / is C n if df is C n 1 ; if / is C n for all n e N, then we say that / is 
C 00 or smooth. 

Continuity of the derivative already implies many other properties if the involved 
vector spaces are locally convex. Hence, let X, Y be locally convex spaces and 
/ :X D U —* Y be continuously differentiable. Then it can be shown that [116, 163]: 


(a) the fundamental theorem of calculus 


r 1 


/(x + h)-/(x) = 


d/(x + th;h)dt 


Jo 

holds if x + [0,1 ]h c U, 

(b) / is locally constant if and only if df = 0, 

(c) the map h <-* d/(x; h) is linear, 

(d) / is continuous (not necessarily true iiX, Y are not locally convex!), 

(e) if / e C n , the map (h l3 ..., h n ) >-» d n /(x; h lt . ..,h n ) is symmetric and multilin¬ 
ear and we can use yet another notation 


(d n /(x), h]_®---®h n )= d n f{x\ hi,..., hj, 
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(f) if / e C n+1 , Taylor’s formula 


/O + h) = /(x) + d/(x; fr) + • • • + —d n /(x; h,...,h) 

n! 



* 1 

(1 — t) n d n+1 /(x + th;h,... ,h) 

Jo 


holds for x + [0,1 ]h c U. 


Moreover, given locally convexX, Y, Z, open subsets U cl, V c Y and C n maps 
/ : U —> V, g :V —» Z, the chain rule holds for the composition go/, i.e., also the 
composition g of is C n [116, 163], 

In ordinary calculus one can show that a continuously differentiable function 
function is locally Lipschitz. An analogous result holds for the directional derivative 
on normed spaces (see also [3]): 

Proposition 3.2. Let f : X —» Y be a continuously differentiable map between the two 
normed spaces (X, || • || x ) and (Y, || • || y ). Then f is locally Lipschitz, i.e., for every convex 
neighbourhood U of x 0 eX there exists a K >0 such that for all x 1; x 2 e U 


ll/Oi)-/(>2)llw < K ||xi — x 2 ||v- 


Proof. Since the derivative d/(x; h) is continuous and linear in h eX, there exists a 
convex neighbourhood U of x 0 such that 

l|d/(x;h)||y < ||d/(x)|| op ||h|| x <K\\h\\ x 


for all x e U. As Lipschitz constant K we can choose the supremum of x —» ||d/(x)|| op 
in U. By the fundamental theorem of calculus we have for x x ,x 2 e U 


/Oi)-/(>2) = 


" 1 

d/ (^2 + t (xi - x 2 ); x : - x 2 ) dt. 


Jo 


Hence, taking the norm on both sides, the previous equation yields 


ll/Oi)-/(>2)lly < 


- 1 

||d/ (x 2 + t(x 1 - x 2 );x x - x 2 ) |[ y d t <K ||x x - x 2 || x . 
Jo 


□ 


Later on we will often encounter spaces of differentiable and smooth functions 
and thus need an appropriate topology on these space: Let X, Y be a topological 
vector space such that Y is locally convex and U c X open. We can equip the vector 
space C n (JJ, Y) of all n-times continuously differentiable maps between X and Y with 
the seminorms 

\\f\\i,k,K = sup||d fc /0)ll;,o P 
xeK 


for all / e C n (I7, Y), every compact K c U and 0 < k < n. These seminorms 
induce an initial topology on C n (JJ , Y) turning it into a locally convex space. This is 
another example of a compact-open topology or topology of uniform convergence on 
compacta. Note that, if Y is a Frechet space and U is cr-compact, C n (JJ,Y ) becomes 
a Frechet space. 
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3.3.2 The Frechet derivative 

On Banach spaces it is possible to define another directional derivative, the Frechet 
derivative. Given Banach spaces X, Y and an open subset U c X, a map f :U —*Y 
is called Frechet differentiable at x e U if there exists a bounded linear operator 
D/(x) : X —» Y, the Frechet derivative of / at x, such that 

lim ||h||^ 1 (/(x + h)*=/(x)-D/(x)h) = 0. (3.2) 

Plh—*o 

The operator D/(x) is unique if it exists. In analogy to the directional derivatives that 
we encountered so far, we also write D h /(x) = D/(x; h) = D/(x)/i. We call / Frechet 
differentiable if the Frechet derivative exists for all x e U. If the Frechet derivative is 
continuous in x, then / is continuously Frechet differentiable. 

The Frechet derivative is closely related to the directional derivative defined above 
(see also [3]): 

Proposition 3.3. Let X, Y be Banach spaces, U c X open and f : U —> Y a map. f is 
Frechet differentiable if and only if f is continuously differentiable. In that case the two 
derivatives agree. 

Proof. We can bring (3.2) into agreement with (3.1) by replacing h in (3.2) 
by th, tel, and take the limit || th\\ v —» 0 along the ray of h, i.e., by taking t to zero 
while keeping h fixed. Moreover, D/(x) is clearly continuous because it is linear and 
bounded. 

As in proposition 3.2, since the derivative d/(-) is a continuous linear 
map, there exists a (convex) neighbourhood V of x where it is bounded. Using the 
fundamental theorem of calculus again, we obtain for any jeV and sufficiently 
small hex 

11/(x + h) —/(x) — d/(y;h)|| y < sup ||d/(x + th) - d/(y)|| op ||h|| x . 

te[0,i] 

In particular this holds for x = y and thus / is Frechet differentiable at x with 
D/(x) = d/(x). □ 

It follows that any statement on continuously differentiable maps also holds for 
Frechet differentiable maps. 

Frechet differentiability is a very strong notion of differentiability and many 
theorems from ordinary calculus can be generalized to the Frechet derivative but not 
further to the directional derivative on arbitrary Frechet spaces. An example is the 
inverse function theorem for which holds for the Frechet derivative on Banach spaces 
but does not hold on general Frechet spaces. On some Frechet spaces one has instead 
the Nash-Moser theorem [116]. 


3.4 Fixed-point theorems 

Let us start this section by stating the most elementary fixed-point theorem, the 
Banach fixed-point theorem: 
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Theorem 3.4 (Banach fixed-point theorem). Let f : X —»X be a contraction on a (non¬ 
empty) complete metric space X. Then f has a unique fixed-point x = /(x). Furthermore, 
taking an arbitrary initial value x 0 el, x is the limit of the sequence (x n ) defined by 
the iterative procedure x n+1 =/(x„). 

We will not prove this theorem here; the proof is not difficult and can be found in 
essentially any introductory book on (functional) analysis. Instead we will dissect, 
specialize, generalize and finally prove various parts of this theorem separately. 

3.4.1 Existence and uniqueness 

Let us start with a useful lemma: 

Lemma 3.5. Let ■••cVjC V k _ 1 c • • • c V Q be a decreasing sequence of sets. Suppose 
there exists afunctional f such that f : V k —»\4 +1 for every non-negative k < n. Any 
fixed-point x = /(x) in V 0 is already in V n . 

Proof. Suppose that xel/ 0 but x ^ V n is a fixed-point. Then there exists ak < n such 
that x € V k and x f V k+1 . Since x is a fixed-point of /, we have that x = /(x), but 
/(x) e V k+1 by the properties of/. □ 

This lemma has serval useful consequences. One example is the following: The 
limit of a convergent sequence in a complete metric space is not necessarily as 
regular as all the elements of the sequence; a priori the regularity of the limit is 
only controlled by the topology induced by the metric. However, if the limit is the 
fixed-point of a smoothing map, the situation is much better. 

Corollary 3.6. LetX,Y be a topological vector spaces and U cl open. Further, let V k c 
C k (U ,Y) for all k such that V k c V k _ 1 . Suppose there exists a smoothing functional f 
such that f : V k —* V k+1 for every non-negative k < n. Any fixed-point x = /(x) in C° 
is already in C n . □ 

If one is interested only in existence of fixed-points but not their uniqueness, then 
one can perform a straightforward generalization of the ‘existence’ part of Banach’s 
fixed-point theorem: 

Proposition 3.7. Let (X, d) be a non-empty complete metric space and f : X —* X a map. 
Assume that there exists a subset U c X such that f : U —» U and f is a contraction 
on U with Lipschitz constant K e [0,1), i.e., for all y,z e [/ 

(z)) < Kd(y,z ). 

Then there exists a fixed-point x = F(x) in X. 

Proof. Define for an arbitrary x 0 e U the Picard sequence (x n ) where x n+1 =/(x„). 
Using the contractivity of / on U, we get 

d ( x n+l> x n ) < Kd(x n ,x n _ i) < K n d(x 1 ,x o). 

One can then easily show that (x n ) is a Cauchy sequence and take the limit n —» oo in 
x n+1 =/(x n ) to see that there exists a limit x =/(x) inX. □ 
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This proposition does not guarantee uniqueness of the fixed-point because the 
mapping is only required to be a contraction on a subset of a complete metric 
space and the fixed-point is not necessarily contained in this subset. Nevertheless, 
uniqueness holds if the mapping is of the form assumed in Lem. 3.5. Moreover, if the 
mapping is smoothing as in Cor. 3.6 then the unique fixed-point is even C n . 

Proposition 3.8. Let (X , d) be a non-empty complete metric space and (VjJ be a decreas¬ 
ing sequence of sets as in Lem. 3.5 such that V 0 czX is closed. Suppose that f : V k —»14+t 
for every non-negative k < n such that f is a contraction on V n . Then f has a unique 
fixed-point x = /(x) e V n . 

Proof. The existence of fixed-points in V 0 that are contained in V n follows from 
Lem. 3.5 and Prop. 3.7. Assume now that there exist two distinct fixed-points x,y. 
Since / is a contraction on V n , we have 

d(x,y) = d(/(x),/(y)) <Kd(x,y), 

where K e [0,1) is the Lipschitz constant of/, and thus arrive at a contradiction. □ 


3.4.2 A Lipschitz continuity criterion 

Next we will see that it is not necessary for a map to be a contraction for it to have 
fixed points. In fact it is sufficient for the map to satisfy a certain Lipschitz continuity 
condition: 

Lemma 3.9. Let ( X,d ) be a non-empty complete metric space. Suppose there exists 
K € K + such that f : X —*X satisfies 

d(/V),/ n (j))<^-d(x,y) 
for all xjel and n e N. Then f has a unique fixed-point. 

Proof. Since nl grows faster than K'\ there exists a N such that f n is a contraction 
for all n> N. If we set V k = f k (X), we can apply Prop. 3.8 and the thesis follows. □ 


The special bound assumed in Lem. 3.9 is in fact very natural if / is the intgral 
functional 


/ : C[a,b ] 


r t 


C[a,b], /O)(t) = / 0 (t) + 


k(x)(s) ds, 


(3.3) 


where / 0 e C[a, b ] and with integral kernel k : C[a,b] ^ C[a, b]. Recall that space 
of continuous functions C[a, b] in the interval [a, b] can be turned into a Banach 
space by equipping it with with the uniform norm 


imic[a,b] = imioo = sup |X(t)|, 
te[a,b] 

where we will use |pf || C [ a instead of the more common HXH^ to emphasize the 
interval over which the supremum is taken. 
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Proposition 3.10. Let f be of the form (3.3) with k continuously differentiable in 
U c C[a, b] open such that f closes on a closed subset V c U, i.e., /(V)cV. Then f 
has a unique fixed-point in V. 


Proof. We can show the statement using Lem. 3.9 and an inductive procedure. 
Applying Prop. 3.2, we find that k is locally Lipschitz as a functional on U; denote 
by L = sup xe[/ ||d/c|| op its Lipschitz constant. Using the uniform norm on C[a, t], we 
thus obtain 


r t 


ll/M-/(y)llc[a ;t] 


< 


II k(x) - fc(y)|lc[a,t] d5 < - a)||x - y|| C [ a ,b]- 


Suppose now that 


L n (t — a) n 

\\f n W -f n iy)\\ c[a ,ti < n , ll*-yllc[a,b]- 


holds up to n and for arbitrary t G [a, b]. Then, 


r t 


|/ n+1 (x)(t)-/ n+1 (y)(t)l 


n +It 


< 


\\{kof n \x)-{kof n \y)\\ c[as] ds 


r t 


< L 
V 


< 


n\ 


ll/ n W-/ n (y)|lc[a jS ]d5 


(s-a) n ||x-y|| c[ab] ds 


L n +\t-a) n+1 

~ (n + 1)! ^ _ ^llc[a.b]> 

which implies that (3.4) holds also for n + 1, thus concluding the proof. 


(3.4) 


□ 


3.4.3 Closed functionals 

The last proposition contains an apparently minor but in fact very strong condition, 
namely that the functional k closes within the set V. In any application of a fixed- 
point theorem similar to Banach’s theorem, the crucial point to check is usually not 
that the map is a contraction but that it is closed. In the given case of an integral 
functional (3.3), however, we can always be assured that there exists an interval 
I c [a, b] on which the functional closes [3]; this interval might be very small. 

Proposition 3.11. Suppose that k is bounded on a set U c C[a, b] which also includes 
a ball V around f 0 defined as V = {x | ||x — /ollc[a,b] < 5} for some 5. Then there exists 
t e (a, b] such that f satisfies /"(L/) ["[ Qjt ] C t/ T[ a ,t]- 

Proof. Since k is bounded on U, it clearly satisfies 

l|fc(*)llc[a,t] ^ H fc O)llc[a,b] <K = SUp ||k(y)|| C [a,b] 

yeU 
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for all x e 17. Then, taking the norm of (3.3) after subtracting / 0 , one obtains 

ll/O) - fo\\c[a,t] < (t - a)PO)llc[a,t] < (t - a)K, 

because V c 17. For any 5 we can always find a t such that (t — a)K < 5 and therefore 
f (17) ( [a,t] c ^ f[a,t] • The thesis follows because V C 17. □ 


3.5 Microlocal analysis 

3.5.1 Distributions 

We will now define three important function spaces and their topological duals, which 
will be called spaces of distributions. 

To conform with standard notation we denote the space of smooth functions on an 
open subset 17 c by 

«([/) = C°°([7, C). 

As observed in Sect. 3.3.1, it is a Frechet space with the compact-open topology. The 
elements of the topological dual ^'(U) are called compactly supported distributions. 

The vector space of rapidly decreasing (or decaying ) functions will be denoted by 
<S(17). We say that a smooth function / e < g(17 n ) is rapidly decreasing (decaying) if 

ll/lli,n,m = sup (1 + |xr)||d m /(x)|i; < OO 

xeU n 

for all i,n,m. We equip <S(17) with the topology induced by these seminorms and 
see that it is a Frechet space. The topological dual §'(U) is the space of tempered 
distributions or Schwartz distributions. 

Another subspace of %(U) is the space of test functions, denoted by 

95(17) A C“(17,<C). 

We can equip this space with a topology similar but more complicated than that 
of ig(t7). If K c 17 is compact, we can endow 95(16) = ”8(16) with the subspace 
topology. Then, taking a compact exhaustion c K 2 c • • •, (J ; K t = 17, the topology 
on 9)(17) is the initial topology defined by the projections 7t ; : 9)(fC ; ) —» 9>(t7). This 
topology is not Frechet unless 17 is compact and 'S(U) is already Frechet, in which 
case 95(17) = ^(17). The topological dual 9s 7 (17) is the space of distributions. 

More generally, we define ^(17, X), with & = 95,'S or S, as the spaces of functions 
with values in a locally convex vector space X and by SF 7 ( U,X ) the associated distribu¬ 
tion spaces. The necessary generalizations to the definitions above are straightforward 
but note that the resulting function spaces are not Frechet unless V is already Frechet. 
Moreover, it is possible to define T-valued distributions ^'(U,X,Y), where Y is a 
locally convex space. 

Given a distribution u e 9) / (t7), we can restrict it to a distribution u v on any open 
1/ c 17 by setting 


u v (f) = u(/) 


(3.5) 
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for every / e 25(\7). A distribution is uniquely determined by its restrictions: If 
(Ui)i gN is an open cover of U and tq G such that u t = u ; - whenever U i n Uj ^ 0, 

then there exists a unique u e 9b'(U) such that iz ; is the restriction of u to U t for 
every i. 

The support suppu of a distribution u e is the smallest closed set V c U 

such that the restriction of u to U \ V vanishes. More precisely, 

suppu = U \ |^J{V C U open | iz(/) = 0 V/ e 2>(1/)}. 

It follows that iz(/) = 0 if suppu n supp/ = 0 and that the space < g / (Ll) is indeed the 
space of compactly supported distributions. 

3.5.2 Distributions on manifolds 

The discussion above does not yet encompass the case of distributions on smooth 
manifolds because manifolds are not vector spaces. However, manifolds are locally 
homeomorphic to a vector space - Euclidean space. 

Let M be a smooth manifold, £->Ma smooth vector bundle and ([/ ; ) ieN an open 
cover of M such that (£/j, <p ( ) are coordinate charts and (L/jji/q) local trivializations. 
We define again the space of smooth sections 

< g(M,£) = r 0O (£) 

as the space of functions / : M — * E such that t/q ° / ° pf 1 is smooth for each i, i.e., 
we require 

Vh °f ° ^r 1 

A locally convex topology that turns %{M, E) into a Frechet spaces is the initial 
topology induced by the product topology on the right-hand side of the injection 

n 

ieN 

the topology is independent of the choice of the cover (Uf) j gN . The topological dual 
of is the space of compactly supported distributional sections < g / (M,E). 

The space of compactly supported smooth sections, the test sections, is denoted 

25 (M,E) = r 0 °°(£). 

Analogously to the vector space case we define an initial topology on 25(M, E) induced 
by that on E), where (Kf form a compact exhaustion of M; whence the space 
of test sections becomes a Frechet space. The space of distributional sections 2/( M, E) 
is the topological dual of 25(M, £). 

The restriction of distribution generalizes to distributions on manifolds in the 
obvious way: Given a manifold M and an open subset U c M, every distribution 
u e 2/(M, E) can be restricted to a distribution u v e 2) / ([7, E v ) by setting 


Ujjif ) = u(/) 
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for all / e 2)([/, E^). Also on a manifold a distribution is completely supported by its 
restrictions. 

It does not make sense to define a notion of rapidly decaying sections or tempered 
distributional sections on manifolds. It is also clear, that the concept of distributions 
can be further extended to objects such as Frechet manifolds in very much the same 
way as above for smooth manifolds. 


3.5.3 Nuclearity and the Schwartz kernel theorem 

All the function spaces but none of the distribution spaces defined in the previous two 
sections are Frechet. However, all the function and distribution spaces (with either 
the weak or strong topology) are nuclear. For this reason, we have the isomorphisms 

2>([/,X) ~ a([7) ®X, 3/([/,K,X) ~ 9>'([7) §X, 

^ «([/) ®X, «'([/,K,X) ~ «'([/) ®X, 

S(R m ,X) ~ «(R m ) ®X, S'(R m , K,X) ^ S'(R m ) ® X 


for a complete locally convex topological vector space X and an open set U c R m . As 
another consequence we can specialize the abstract kernel theorem ( cf. Sect. 3.1.5) 
to these function spaces under which circumstances it is called the Schwartz kernel 
theorem. One finds the following isomorphism (open subsets U c R m and V c R n ): 

«'([/ x V) ~ «'([/) ® «'(V) ^ L («([/), «'(V)), 
a'fu x y) ~ 2/([/)§9/(y) ~ L(9)([/),2) / (y)), 
s / ( R m+n ) ^ §/(R m ) § S^R") ~ L (jg(R m l ^'(R")), 


where L(X, 7) denotes the space of continuous linear maps between topological vector 
spaces X and Y with the topology of uniform convergence. Analogous isomorphisms 
(at least for % and 9)) exist for both sets of isomorphisms also for functions and 
distributions on manifolds. 

As a consequence of these isomorphisms, there exists for every distribution 
K e 9/([/ x y) a unique linear operator K : 9)([/) —» 2/(y) and, conversely, to every 
linear operator K a unique distribution. Let / e 9>([/) and g e 9)(y) be test functions. 
Formally we can write 


r 

K(J ® g) = 


K(x,y)f(x)g(x)d m xd n y 


JUxV 


for the distribution with distributional kernel X(x,y) and 


(K/)(y) = 


K(x,y)f(x)d m x, ( c Kg)(x) 


r 

K(x,y)g(y)d n y, 

Jv 


for the associated operator and its transpose. 

The operator K is called semiregular if it continuously 5 maps SS(L7) into “S(y) and, 
analogously, the transpose f K is called semiregular if it continuously maps 28(y) into 


Continuity is meant with respect to the usual topology of %( [V ) and not the subspace topology of 
2>'(V). 
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“§({7). In the case that C K is semiregular, we can uniquely extend K to an operator 
acting on compactly supported distributions "SfU) by duality: 

(Ku)(g) = ( f Kg)(u) 


for all u e %{U) and g e S/V). 

If both K and C K are semiregular, we say that K regular. Moreover, it is called 
properly supported if the projections from supp K c U x V onto each factor are proper 
maps. A properly supported operator K maps 2>([/) to < g / (V) and can therefore 
be extended to an operator 2>(I/) —* 'g/V). Since linear differential operators are 
properly supported and regular, they can be uniquely extended to distributions and 
they can also be composed. 


3.5.4 Fourier transformation and convolution 

Let us denote by L p (M n ) the L p spaces of functions on M n with values in C. That is, 
L p fS n ) is the space of functions for which the Lebesgue integrals 


i /p 


|/(x)| p d n x 


exist and where we identify functions which are equal almost everywhere so that the 
L p spaces become Banach spaces. L 1 functions are called Lebesgue integrable, while 
L 2 are called square-integrable. 

On the space of Lebesgue integrable functions L 1 (M n ), the Fourier transform is 
defined as the automorphism 


& : fix) ~ Hfm = (2U r n /(x)e-^ d"x, 

JR" 

where • denotes the Euclidean dot product. The Fourier transform satisfies 

SF 2 (/)(x) = (2ti)"/(-x) 


and the inverse Fourier transform is therefore given by 


SF'VXx) 


fif)e ix 'S d n f. 

J R" 


When no confusion can arise, we usually denote the Fourier transform of a function / 
by / instead of SF(/). 

By the Riemann-Lebesgue lemma, it is clear that /(/) —» 0 as \q\ —* oo. In fact, 
the Fourier transform is a linear isomorphism from the subspace of rapidly decaying 
functions <S(M n ) into itself. Since the space of rapidly decreasing functions is stable 
under differentiation and multiplication by polynomials, one finds for / e 6/K' ! ) 


3XVX?) = €/*/(?) and Hx l jm = d t JU-l 


(3.6) 
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Moreover, given also g e S(R n ), the Plancherel-Parseval identities are 


/(x)g(x)d n x = (2<T n 


|/(x)| 2 d"x = (27t)- n 


R" 


/(?)g(C)d n ?, 

i 

l/(?)| 2 d n ?. 


(3.7) 


As a consequence, the Fourier transform can be extended to an isomorphism of L 2 (R n ) 
into itself. 

The Plancherel-Parseval formula (3.7) guides us to extend the Fourier transfor¬ 
mation SF further to the space of tempered distributions S / (R n ) by 


{u,f) = {u,f) 


for all u e S'QSJ 1 ) and / e S(R n ), i.e., it is the transpose of the Fourier transformation 
on rapidly decreasing functions. It follows that ^ is a linear isomorphism from ^(R") 
(with the weak topology) into itself and analogues of the relations (3.6) hold also for 
tempered distributions u e R n ): 

^(^u)(£) = £„u(€) and SF(x^u)(£) =-3 M u(£). 

If we restrict to the space of compactly supported distributions, the Fourier 
transform of u e < g / (R ri ) is equivalently given as the smooth function 

u(?) = (u,/:x-e- i ^). 


The Fourier transform u can be directly extended to C n as an entire analytic function. 
The convolution of two Lebesgue integrable functions /,je L 1 (R n ) is defined as 


- 

(/ * g)M = 


fiyMx-y)d n y. 


JR" 


The product thus defined is dual to the usual product with respect to Fourier transfor¬ 
mation. To wit, the identities 

Hf*g)=fg and SF(/ g) = (2'rt)“ n (/ * g) 


hold and are the result of the convolution theorem. Note that for distributions u e 
§'( R n ) and v e ig^R"), the convolution u * v is a well-defined tempered distribution 
and its Fourier transform satisfies ^(u * v) = u v as in the convolution theorem. If 
also the product uv is well-defined as a (tempered) distribution, cf. Sect. 3.5.7), then 
other statement of the convolution theorem holds and SF(uv) = (2u )~ n (u * v) 


3.5.5 Singularities and the wavefront set 

Every locally Lebesgue integrable function u e L^R") can be identified with a 
distribution in 2> / (R ri ), denoted by the same symbol, via 

r 


u(/) = (u,f) = 


u(x)/(x)d n x 
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for all / € < 2)(R n ). We say that a distribution u e 2> / (R n ) is smooth, if it is induced 
from a smooth function via this duality pairing. More specifically, every smooth 
function corresponds to a distribution in this way and in fact 2>(K n ) is isomorphic 
to a dense subset of 2/(R n ). Therefore this pairing uniquely extends to a pairing 
between 2)(R ri ) and 3/(R n ). Using the Plancherel-Parseval identity (3.7), it can be 
written explicitly as 


u(/) = ( 2tt)" n 


Xu(0f(-0 d n f, 


(3.8) 


where u e SS^R”) and % e 2£(®0 such that % = 1 on a compact neighbourhood of 
the support of / e 2)(R n ). This pairing may be considered the motivation of the 
wavefront set to be defined below. 

The singular support singsuppiz of a distribution u e 9/(R") is then defined as 
the complement of the union of all open sets on which u is smooth in the sense of 
the pairing above. In other words, it is the smallest closed subset licR” such that 
UR»\u e w n \ u). 

The Fourier transform, introduced in the previous section, can be used to give 
a condition on the smoothness of a compactly supported distribution u e ig / (R n ). 
Namely, u is smooth if and only if for each n € N 0 there exists a constant C n such that 

iu(?)i<c n a + i?ir n 


for all (£R”. 

Checking this condition for certain f, a regular direction of a compactly supported 
distribution u € ^(R") is a vector (GR" \ {0} such that there exists an open conical 6 
neighbourhood r of £ and such that 

sup(i + Kimoi < °o 

?er 

for all n e N 0 . Conversely, a E, is called a singular direction of u if it is not a regular 
direction. The (closed) set of all singular directions of u is 

27(u)A{ ?e R"\{0}|£ is not a regular direction of u}, 

i.e., the complement of all regular directions. 

We can localize the notion of singular directions and say that f is a singular 
direction of u € at x e U, where U c R n is open, if there exists a n e N 0 such 

that 

sup(i + i<rim Z uxoi 

?er 

is not bounded for all % e 2)(L7) localized at x (i.e., %(x) 0). That is, the set of 

singular directions at x is the closed set 

S x (u) = f]E( X u), 
x 


6 A cone in R" is a subset fcl" such that AF = r for all A > 0. 
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where the intersection is over all % e 2)([/) such that ^ 0. 

This leads to the definition of the wavefront set as the set of the singular directions 
at all points: 

WF(u) = {(x; £) e U x (R n \ {0}) | £ e 27 x (u)}. 

Thus the wavefront set is a refinement of the notion of singular support. Moreover, it 
can be used as a practical tool for calculating the singular support because singsupp u 
is the projection of WF(iz) onto the first component. The wavefront set has the 
following properties: 

(a) WF(/u) c WF(iz), 

(b) WF(iz + v) C WF(iz) U WF(v), 

(c) WF(Piz) C WF(iz) 

for all distributions u, v e 2/(L7), localizing functions y e 9)(f7) and linear differential 
operators P (with smooth coefficients). 


3.5.6 Wavefront set in cones 

Let U c R n be open and r c U x (R n \ {0}) a closed cone, where we have extended 
the definition of a cone to sets for which the projection to the second component at 
each point is a cone. We define distributions with wavefront set contained in the 
cone r as 

2^(L0 = {u e 9J'([7) | WF(u) c r}, 

which is not empty for any cone f. The normal topology 7 turns 9S^([/) into a complete 
nuclear space [54]. It is induced by the seminorms 

IMIb = sup|u(/)| and ||u||„ yiX = sup (1 + \E,\Y\&(x u XZ)\, 

f&B E,eV 

for all bounded sets B c 9)(L/), n e N 0 , localizing functions % G 9)(L7) and closed 
cones y CR”\ {0} such that supp(^) xVcT 

Given a closed cone r as above, define the open cone 

71 = (r'y = {(x; OGi/x(R"\{0}) | (x; -£) ^ r} 

as the complement of the reflection of r and 

^(M n ) = {v£ | WF(v) c A} 


as the space of compactly supported distributions with wavefront set contained in A. 
Then one can find an analogue to the pairing (3.8) for all u e <3>' r (K n ) and v e ^(R n ) 
given by [54] 


(u,v) = (20 n 


^u(Ov(-0 d"?. 


J K" 


7 The normal topology [54] is finer than the often emplyed Hormander topology for these spaces. 
Nuclearity also holds for the Hormander topology but not completeness. 
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for any % e 2S(R) such that ^ = 1 on a compact neighbourhood of suppv. < g^(R' 1 ) 
thus becomes the topological dual of (with the normal topology) and 

(equipped with the strong topology) it is also nuclear but not complete unless A is 
also closed [54]. 

3.5.7 Pullback of distributions 

Let U, V be open subsets of R n and i : U —* V a diffeomorphism. The pullback i*u of 
a distribution u e 2/(V) is (uniquely) defined for every / e 2£([/) as the transpose of 
the pushforward (up to the Jacobian determinant) 

(i*u,f) = (u, t j |detdi|) 

or, equivalently, as the continuous extension of the pullback on smooth function. 
Consequently, for any closed cone r c V x (R n \ {0}), one obtains 

= % r (U), L*r = { (x; T*i(0) [ 0(x); £) e rj (3.9) 

and hence WF(i*u) = CWF(u). 

Trying to generalize this result to cases where t : U —* V is not a diffeomorphism 
but an embedding of an open subset of R n into an open subset of R m can fail if there 
are (x; £) such that T* t(^) = 0. It follows that a distribution u e ^/(V) can only be 
pulled back to a distribution l*u if WF(u) n N = 0, where 

N = { (t(x); ^)eyxM m |iei/, T* t(?) = 0} 

is the set of conormals of i. 

Given two distributions u e 2/( U ) and v e Qs'fV), where iicR" and V c R m 
are open, the tensor product 


u <g> v : f <8) h —» u(/)v(h) 

is a distribution in 3/(1/ xV)~ 2/([/) ® 2/(V) via Schwartz’s kernel theorem. One 
can show that its wavefront set satisfies 

WF(iz ®v)c (WF(u) x WF(v)) U ((suppux{0})x WF(v)) 

U (WF(iz) x (suppv x {0})). 

It is possible to pullback the tensor product u <g> v of two distributions over the 
same space (i.e., U = V) with the diagonal map 

A : U x U —» U, (x, x) >-» x 

if WF(iz O v) n N a = 0, where N A is the set of conormals with respect to the map A, 
which gives the (unique) product izv of the two distributions. This requirement of the 
wavefront set implies that it is possible to multiply two distributions if and only if 


(x.f)eWF(u) => (x,—S)^WF(v) 


(3.10) 
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and then wavefront set of the product is bounded by 

WF(izv) c {O; £ + Q | (x; e WF(u), (x; Q e WF(v)} 1 

U WF(u) U WF(v). 

Note that for u, v that do not satisfy (3.10), the singular directions would add up to 
zero in the first term on the right-hand side of (3.11). 

3.5.8 Wavefront set of distributional sections 

The wavefront set can be extended to distributions on vector-valued functions 
component-wise, i.e., using 2/(17,K m ) ~ 2/(17) ® K m ~ 2/([/)® m . Namely, one 
defines for u e 2/(17, K m ) 

m 

WF(u) = [^J WF(Uj), 

i =1 

where u t e 2/(17) are the components of u. This definition is invariant under a 
change of basis because such a change only implies a multiplication of (u,-) by a 
matrix with smooth components. 

Moreover, the wavefront set being a local concept, it generalizes to manifolds 
and distributional sections in a coordinate neighbourhood via local trivializations. 
However, to be meaningful, it needs to transform covariantly under diffeomorphisms. 

Let (t/;); 6 ^ be an open cover of a smooth n-manifold M such that (17 ; , ip ; ) are 
coordinate charts and (17;, t/q) are local trivializations of the vector bundle E —» M. 
Given a distribution u e 2/(M,£) with restrictions u ; to 17;, the wavefront set for 
every u ; given by 

WF( Ui ) = { (x; T>(£)) e 17 ; x (M n \ {0}) | (ip(x); ?) e WF(^ ; o u t o c^r 1 ) }. 

and transforms as a conical subset of the cotangent bundles T*17; as seen by (3.9). In 
particular, WF(u ; ) n T*(17; n 17j) = WF(u ; ) n T*(17; n 17j) for all i, j. 

The wavefront set of distributional sections u is then defined as the union of all 
WF(iz ; ). In other words, it is the set of points 

(x; £) e t*M = T*M \ {(y; 0) e T*M}, 

the cotangent bundle with the zero section removed, such that (x; ff) e WF(i%), 
where 17 is a coordinate and trivialization neighbourhood of E. 

3.5.9 Some distributions and their wavefront set 

For any / e 1§(M), Dirac’s 5-distribution is 

5 (/)=/( 0 ) 

and it follows that 5 has support only at the origin. There it does not decay in any 
direction because 5 = 1 so that WF(5) = {0} x (R \ {0}). Consequently powers of the 
5-distribution cannot be defined. 
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The Dirac 5-distribution can be decomposed into two distributions 


<5±(/)= lim 

£—>0 + 


/OO 

x ± ie 


dx, 


for all / e <S(K), such that —2tti5 = 5 + + 5_, where WF(5 ± ) = {0} x K ± . Now, 
powers of either 5 ± are well-defined but the distribution 5 + 5_ does not exist. 

The wavefront set of 5 + (and analogously that of 5_) can be calculated as follows: 
Using the residue theorem, the Fourier transform ofl/(x + is) for e > 0 is 8 


" e -i*? 

-dx = —2tti@(^) e~^ £ . 

J R * + ie 

Taking the limit e —* 0 + , this gives 5 + (^) = —2tti0(£). Then, applying the convolu¬ 
tion theorem, one obtains the Fourier transform of y5 + for all / e 26(K) as 


r? 


&(X5+) = — (l*<5+) = -i 

2tt 


/(fc)dfc. 


Since this decays rapidly as £ —» — oo and does not decay as E, —* oo, we get the 
expected wavefront set. 

Related to the diagonal map A : (x, x) >-» x, we can define for all / € 9)(R 2 ) a 
diagonal distribution 




/(x,x)dx. 


Jm. 


It is clear that the wavefront set of A is 


WF(A) = N a = {(x, x; l, -£) e M 4 \ {0}}. 


Given instead two functions / 1; / 2 e 2>(M), we can write 


^(/i ® / 2 ) = 


(/i*<5)(x)/ 2 (x)dx 


Splitting the 5-distribution into its positive and negative frequency components as 
above, we can therefore define 


At(/) = I™ 

£^0+ 


/Ocy) 


y — x ± ie 

which in the case / = /i ® / 2 can be written as 


dx dy, 


AtCfi ®/ 2 ) = 


(A *<5±)(x)/ 2 (x)dx. 


It is not a difficult exercise to show that [5, Exmpl. 1.4] 

WF (At) = {(x, x; l,-O e M 4 \ {0} I ± £ > 0}. 


Moreover, using the Plancherel-Parseval identities it is possible to show that A± is 
well-defined for all / 1; / 2 e L 2 (M). 


8 0 denotes the Heaviside step-function. 
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3.5.10 Propagation of singularities 

In Sect. 3.5.5 we already noticed that WF(Pu) c WF(u). That is, knowing the 
wavefront set of the distribution u, we can deduce information about the wavefront 
set of Pu, where P is a differential operator. The theorem on the propagation of 
singularities gives us information in the opposite direction. Namely, WF(Pu) and the 
form of P, tell us a lot about WF(tz). 

Let P : ig(M, E) —* %{M,E) be a differential operator acting on sections of a vector 
bundle E —» M. Its characteristic set is the cone 

charP = {(x; Ef) e T*M [ detcr P (x, Ef) = 0} 

on which the principal symbol cr P of P cannot be inverted. 9 An integral curve of cr P 
in charP is called a bicharacteristic strip, its projection onto M a bicharacteristic. 

Theorem 3.12 (Propagation of singularities). Suppose that P is a differential operator 
with real homogeneous principal symbol such that no complete bicharacterstic stays in 
a compact set of M (i.e., P is of real principal type) and let u,f£ 2/(M, E) such that 
P u = f. Then 

WF(u) c charPUWF(/) 

and, if (x; Ef) e WF(u) \ WF (/), it follows that (x'; Ef) e WF(tz) for all (x 7 ; Ef) on the 
bicharacteristic strip passing through (x; Ef). 


3.6 Wave equations 

Both classical and quantum fields usually satisfy an equation of motion given by a 
wave equation 

P u = f, (3.12) 

where P is a normally hyperbolic differential operator, u is the field and / and an 
external source. On globally hyperbolic manifolds the wave equation can be solved, 
i.e., the Cauchy problem for (3.12) is well-posed. 

3.6.1 Retarded and advanced propagators 

Let (M,g) be a spacetime and P : %{M,E) —* %{M,E) a differential operator on 
sections of a vector bundle E —* M. A linear operator G v : 9s(M,E) —» %{M,E) such 
that for all / € 2)(M, E) 


PG v /=/ and G V P/=/, 

i.e., G v is a left- and right-inverse of P, and 

supp(G v /) c J + (supp/) 

9 If the principal symbol of P is invertible, we say that P is elliptic. Example of elliptic operators are 
the Laplace operator and the Cauchy-Riemann operator. 
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is called a retarded propagator or retarded Green’s operator for P. Similarly, a linear 
operator G A , which is a two-sided inverse of P and satisfies 

supp(G A /) c J“(supp/) 

for all test sections /, is called a advanced propagator or advanced Green’s opera¬ 
tor. 10 We say that P is Green-hyperbolic if it admits unique retarded and advanced 
propagators when restricted to a globally hyperbolic region. 

Given a linear differential operator Q such that P o Q = Q o p, i.e., Q commutes 
with P, then it also commutes with the propagators of P. That is, one finds 

G V Q/=QG V / and G A Q/= QG A / 


for all / e«(M,E). 

If P is Green-hyperbolic, then the transpose operator C P on sections of the dual 
bundle E* is also Green-hyperbolic; we denote its propagators by G^ and G] . They 
are closely related to the propagators of P and one finds 

G v = c (G a ) and G a = f (Gy). 

Since the propagators are regular, they can be uniquely extended to operators 
%'{M, E ) —> 2/(M, £). Although the propagators are not properly supported, they can 
also be defined for some non-compactly supported sections. The geometry of (M, g) 
enables us to define further types of ‘compact’ support: We say that a (distributional) 
section u is future or past compact if there exists a Cauchy surface S such that 

suppucJ + (Z') or suppu cj“(27), 

respectively. Denote by the subscripts ‘fc’ and ‘pc’ the subsets of (distributional) 
sections of future and past compact support. Via the transpose propagators Gf,, G t , 
we can then uniquely extend the retarded propagator to 94(M,£) -» 2/(M, E) and 
the advanced propagator to 9>' e ( M,E)-»9/(M,E). 

Let E be endowed with a bundle metric (•, •)• The formal adjoint P* of P with 
respect to (•, •) is given by 


f r 

(P f,h)n g = (/, P*h) p g 

J M J M 

for all /,h e %{M,E ) such that supp/ n suppfr is compact. If P* = P, the operator is 
called formally self-adjoint. In that case, it follows from the last paragraph that 


(G v /, hj n g 
Jm 


(/,G A h)jU g . 

Jm 


As indicated above, wave operators on globally hyperbolic manifolds play an 
important role and, in fact, they are particularly well-behaved [24, 103]: 


10 Note that our definition of the support of the retarded and advanced propagators is exactly opposite 
to that in [23, 24] and also [72]. 
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Theorem 3.13. Any normally hyperbolic operator P on a globally hyperbolic manifold 
admits unique retarded G v and advanced propagators G A . 


It is not difficult to extend this result to pre-normally hyperbolic operators on 
globally hyperbolic spacetimes. Namely, given pre-normally hyperbolic operators P 
and Q such that P o Q is normally hyperbolic, P possesses unique retarded and 
advanced propagators 


G v — Q ° G v and G A — Q o G A , 

where G v , G A are the propagators for the composite operator P o Q. 


3.6.2 Causal propagator 

The causal propagator is defined as the difference of the retarded and advanced 
propagator 

G = G v — G a . 

From the support properties of the retarded and advanced propagator it is clear that 
supp(G/) = J(supp/) for all / e 9)(M,£). In Sect. 5.3.2 we will see that the causal 
propagator, or rather the associated distribution via Schwartz’s kernel theorem, may 
also be called the commutator distribution or Pauli-Jordan distribution. 

By the regularity of the retarded and advanced propagators, it is clear that G 
extends to an operator —* < 3b'{M,E'). Noting the support property of G, 

this statement can be strengthened to extend the causal propagator to 2>' C (M, E ) —* 
2/(M,E). Here we have denoted by a subscript ‘tc’ the space of (distributional) 
sections of timelike compact support, t.e., the sections u such that 

supp u c J + (Z' 1 ) n J~{1 7 2 ) 
for two Cauchy surfaces S 1 ,S 2 . 

Every smooth and spacelike compact solution of the homogeneous differential 
equation Piz = 0 propagating on a globally hyperbolic spacetime (M, g ) with Green- 
hyperbolic operator P can be obtained by applying G to a test section /. In fact, if we 
denote by % SC (M, £) the smooth sections of E with spacelike compact support, then 
we find the exact sequence 

{0} —> 2>(M, E) -!-» 9>(Af, E) I g sc (M, E) ^ SC (M, £). 

This sequence also entails the fact that the kernel of G is given by P9s(M, £). In other 
words, f — f = Vh for some f,f,h € 2)(M,E) implies that Gf = G f. 

Closely related to the existence of a causal propagator is the question whether 
the Cauchy problem is well-posed. The Cauchy problem for the wave equation Piz = 0 
on a globally hyperbolic manifold (M, g) is the following: 
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Given a Cauchy surface i : 27 —> M with normal vector field n, does there exist a 
unique section u e E ) such that 

Pu = 0, 

< L*U = U 0 , 

L*V n U = U 1 

and the solution u depends continuously on the data u 0 , u 1 e “8(17, t*E ). 

This question can be answered in the positive for normally hyperbolic operators P 
on globally hyperbolic spacetimes. With the appropriate modifications, the Cauchy 
problem can also be formulated for pre-normally hyperbolic operators. Also in that 
case Cauchy problem is well-posed [220]. For general Green-hyperbolic operators 
the Cauchy problem is more complicated and it is not obvious whether the Cauchy 
problem is well-posed. 
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Summary 

In this chapter we discuss the results obtained by Fewster and the author in [95] on 
the enumeration of the run structures of permutations. Some of the results stated 
here will can be applied in the study of the moment problem in quantum field theory 
and the connection will be discussed briefly in Sect. 4.4. 

The first section (Sect. 4.1) gives a summary of the elementary definitions for 
(linear) permutations and circular permutations. Then, the subsections of the second 
section (Sects. 4.2.1 to 4.2.3) deal, respectively, with the enumeration of the run 
structure of atomic, circular and linear permutations. Using a suitable decomposition, 
this is accomplished in each case by reducing the enumeration problem to that for 
atomic permutations. In the third section (Sect. 4.3) we apply and extend the methods 
developed in the preceeding sections to enumerate the valleys of permutations, 
thereby reproducing a result of Kitaev [138]. Finally, in the last section (Sect. 4.4), 
we discuss the original motivation of the work [95] and other possible applications. 


4.1 Permutations 

Let us adopt the following notation for integer intervals: [a .. b] = [a, b] n N = 
{a, a + 1,..., b} with the special case [n] = [1.. n]. 


4.1.1 Linear permutations 

Given a set S, a (linear) permutation of S is a bijection cr : S —* S. In the two-line 
notation of the permutation of a finite set is written as 


cr = 


a b c 
cr(a) cr(b) cr(c) 


where a, 5,c,...eS. It is clear that the order of elements in the first line is irrelevant 
as long as the second line is ordered accordingly. 

The set of all bijection on S forms the (linear) permutation group 6 S of S; the 
group operation is the composition o of functions. There are n! permutations in 6 S 
if S is a set of n elements. In the special case that S = [n], one writes 6 n = ©[„]. 

Given a (strict) total order on a finite set S, i.e., a binary relation < that is 
transitive and trichotomous, the first line will always be ordered in the natural order. 
Since every finite ordered set S is isomorphic to a subset [n] of the natural numbers 
with the standard ordering, this identification will tacitly be assumed henceforth. 
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Therefore the first line can be disposed of and one can use instead the one-line 
notation 

cr = ( cr(l) cr(2) cr(3) ••• ) . 

We see that a permutation is equivalent to a change of the linear order of the set S. 
Further condensing the one-line notation, the permutations of a finite ordered set 
can be identified with words 

cr = cricr 2 cr 3 • • • , 


where the shorthand cr ; = cr(i), i e S, was used. 

4.1.2 Circular permutations 

Instead of considering different orderings of a set along a line, one can study different 
arrangements of the elements of the set on an oriented circle (turning the circle over 
produces in general a different permutation). 

Let S be a set with a distinguished element e. The circular permutations of S are 
the bijections cr : S —> S that preserve e, i.e., cr(e) = e. The circular permutations of S 
also form a group, the circular permutation group £ s ; if S = [n], define £ n = 
Clearly, C s is a subgroup of © s and its cardinality is (n — 1)! if that of S is n. 

A cyclic order is a ternary relation on a set S is a set of triples T c S x3 that 

satisfies 

(a) [a, b,c] e T implies [b,c,a] e T (cyclicity), 

(b) [ a,b,c ] e T implies [c, b,a] ^ T (asymmetry), 

(c) [a, b, c], [a, c, d] e T implies [a, b, d] e T (transitivity), 

(d) a, b, c mutually distinct implies either [a, b, c] e T or [c, b, a] e T (totality). 

Every (strict) total order < induces a cyclic order by setting [a, b, c] e T if and only if 
a < b < c or b < c < a or c < b < a. Conversely, every cyclic order induces different 
possible linear orders. Namely, setting a < b if and only if [a, b,e] for fixed e € S 
yields a total order on S \ {e} which can be extended to a linear order on S by defining 
e as either the minimal or maximal element of the set. Consequently, the natural 
choice for the distinguished element of a finite ordered set S in the construction above 
is the minimal or maximal element of S; we will always choose the minimal element. 

The different notations for linear permutations generalize straightforwardly to 
circular permutations. Given a circular permutation cr of a finite ordered set S, write 

cr = ( 1 cr(2) cr(3) ••• ) . 

To distinguish circular permutations more clearly from linear ones and to highlight 
the circular symmetry, we modify the word-notation in the case of a circular permuta¬ 
tion cr to 

cr = la 2 cr 3 ■ ■ ■ & n , 



4.1. Permutations 85 


in analogy with the notation for repeating decimals when representing rational 
numbers. Moreover, for convenience we define cr(n + 1) = cr(l) for all circular 
permutations cr of n-element sets. 

4.1.3 Atomic permutations 

Let us introduce a special subgroup of & s for a finite ordered set S with minimal 
element e maximal element m. 

Definition 4.1. Define the rising atomic permutations 1 2ft c © s as those permutations 
that satisfy cr(e) = e and a (m) = mfor all a e 2lg. The falling atomic permutations 
cr e 21“ are the reversed rising atomic permutations, i.e., cr(e) = m and cr(m) = e. 

Naturally, the cardinality of 21^ is (n — 2)!. If S = [n], we write 21 + (2l“) and see 
that it is the set of permutations of the form 1 • • • n (n • • • 1). 

Let us discuss the significance of the atomic permutations. We say that a per¬ 
mutation cr e 6 S of S contains an atomic permutation n e 2l r , T c S, if n can be 
considered a subword of cr. The atomic permutation ti in cr is called inextendible if cr 
contains no other atomic permutation n' e 2l r /, T' c S, such that T c t'. 

In particular, any permutation cr e 6 S of S with |S| > 2 contains an inextendible 
atomic permutation n e & T of a subset T c S that contains both the smallest and 
the largest element of S. That is, if S = [n] and we consider cr as a word, it contains 
a subword n of the form 1 • • • n or n ■ ■ ■ 1. The permutation n will be called the 
principal atom of cr. 

Proposition 4.2. Any permutation cre6 s of a finite set S c N can be uniquely decom¬ 
posed into a tuple (n 1 ,..., n k ) of inextendible atomic permutations n l e 2l r ., T t c S 
(non-empty) such that n l ^ T | = n 1 ^ 1 for all i <k and UjT, = S. We call n l the atoms of 
cr. 

Proof. Existence: It is clear that any permutation of a set of 1 or 2 elements is 
an atomic permutation. Suppose, for some n > 3, that all permutations of n — 1 
elements or less can be decomposed into inextendible atomic permutations. Without 
loss of generality, we show that any non-atomic permutation cr e <S n also has a 
decomposition into inextendible atomic permutations. Regarding cr as a word, we 
can write cr = a ■ n ■ co, where a and oj are non-empty subwords. Notice that the 
permutations a ■ n and n ■ co have a unique decomposition by assumption. Since 
an atomic permutation begins or ends with the largest element, we find that a 
decomposition of cr into inextendible atomic permutations is given by the combination 
of the decompositions of a ■ n and n ■ to. 

Uniqueness: This is clear from the definition of inextendibility. □ 

Because of this property, the atomic permutations will prove to be very useful. 

'The rationale for this naming should become clear later when we see that arbitrary permutations 
can be decomposed into atomic permutations, but no further. 
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4.1.4 Mountaineering 

Given a (linear or circular) permutation cr of an ordered set S of cardinality n, a 
position i < n is a descent of cr if cr(i) > cr(i + 1). Any i < n of a permutation cr that 
is not a descent is called an ascent of cr. For example, the permutation 52364178 
has the descents 1,4,5 and the ascents 2,3,6,7, whereas the circular permutation 
1.4536782 has the descents 3,7,8 and the ascents 1,2,4,5,6. 

All the descents of a (linear or circular) permutation cr can be collected in the 
descent set 

D(a ) = {i | i is a descent of cr}. 


It is an elementary exercise in enumerative combinatorics to count the number of 
linear permutations of [n] whose descent set is given by a fixed SC[n-l], Let 
S = {s 1; s 2 ,... ,s k } be an ordered subset of [n — 1], then [37, Thm. 1.4] 


/?(S) = |{cre© n | D(cr) = S}\ = 


~T I 


res 


s 1> 5 2 — 5 1» 5 3 — 5 2> • • • > n — s k 


This result can also be adapated to circular permutations. 

Related to the notions of ascents and descents are the concepts of peaks and 
valleys. A peak occurs at position i e [2.. n — 1] of a linear permutation cr if 
cr(i — 1) < cr(i) > cr(i 4- 1), whereas a valley occurs in the opposite situation 
cr(i — 1) > cr(i) < cr(t + 1). Again, this notion can be generalized to circular 
permutations, where, additionally, 1 is always a valley and n is a peak if and only if 
cr(n) > cr(n — 1). In the example above, 4 is a peak 2,6 are valleys of 52364178, 
whereas 3,7 are peaks and 1 is a valley for 1.4536782, see also Fig. 4.1. 


4.2 Run structures 

Definition 4.3. A run r of a (linear or circular) permutation cr is an interval [i..j] 
such that a(i ) ^ cr(i + 1) ^ ^ cr(j) is a monotone sequence, either increasing or 

decreasing, and so that it cannot be extended in either direction; its length is defined to 
be j — i. If a is a permutation of an n-element set, the collection of the lengths of all 
runs gives a partition p of n — 1 (linear permutations) or n (circular permutations). The 
partition p is called the run structure of cr. 

It follows that a run starts and ends at peaks, valleys or at the outermost elements 
of a permutation. For example, the permutation 52364178 has runs [ 1.. 2], [2.. 4], 
[4..6], [6..8] with lengths 1,2,2,2, whereas the circular permutation 1.45 36782 
has runs [1..3], [3.. 4], [4.. 7], [7. .9], of lengths 2,1,3,2. Representing these runs 
by their image under the permutation, they are more transparently written as 5 2, 
236, 641, 178 and 145, 53, 3678, 821 respectively. The runs of permutations can 
also be neatly represented as directed graphs as shown in Fig. 4.1. In these graphs 
the peaks and valleys correspond to double sinks and double sources. 

Motivated by a problem in mathematical physics [90] (see also Sect. 4.4), we are 
interested in the following issue, which we have not found discussed in the literature. 
By definition, the run structure associates each permutation a e £ n with a partition p 
of n. For example, 1.4536782 and 1.3452786 both correspond to the same partition 
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®^(2>^(D^(6)K4MI>^(ZMD 


Figure 4.1 . The two directed graphs representing the runs of the linear permutation 
52364178 (left) and the circular permutation 1.4536782 (right). Peaks and valleys 
are indicated by boldface numbers. 

1+24-2 + 3 of 8. Our interest is in the inverse problem: given a partition p of n, 
we ask for the number Z e (p) of circular permutations whose run structure is given 
by p. One may consider similar questions for other classes of permutations, with 
slight changes; for example, note that the run structure of a permutation cr e 6 n is a 
partition of n — 1. 

To put the research in [95] in perspective with the existing literature on the enu- 
merative cominatorics of permutations, a short remark is in order: The enumeration 
of permutations according to their run structure was already discussed by Andre [16] 
for alternating permutations, i.e., permutations that alternate between ascents and 
descents. In [42] the enumeration of linear permutations according to the order and 
length of their runs was studied, so obtaining a map to compositions, rather than 
partitions. In contrast to this approach, the method discussed in [95] was designed 
to facilitate computation; for the application in [90] calculations were taken up to 
65 runs using exact integer arithmetic in Maple™ [152], 

4.2.1 Atomic permutations 

We now begin the enumeration of atomic permutations according to their run struc¬ 
ture. That is, for every partition p of n — 1 we aim to find the number Z a (p) of atomic 
permutations 2l± of length n. 

Observe that any cr e 21+ can be extended to a permutation in 21+ +1 by replacing 
n with n + 1 and reinserting n in any position after the first and before the last. 
Thus, 13425 can be extended to 153426, 135426, 134526 or 134256. Every 
permutation in 2l+ +1 arises in this way, as can be seen by reversing the procedure. 
The effect on the run lengths can be described as follows. 

Case 1: The length of one of the runs can be increased by one by inserting n either 
at 

1. the end of an increasing run if it does not end in n + 1, thereby increasing its 

length (e.g., 13425 134526) 

-0-0—o-o- o-o-0^00 

2. the penultimate position of an increasing run, thereby increasing its own length 
if it ends in n + 1 (e.g., 13425 —» 135426) or increasing the length of the 
following decreasing run otherwise (e.g., 13425 —» 13425 6) 
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■o^o—0-0 

■0-0-0-0+i; 


■OOO-0-0 

0 - 0 —ooo 


Case 2: Any run of length i + j > 2 becomes three run of lengths 1, i and j if we 
insert n either after 

1. i elements of an increasing run (e.g., 13425 —* 153426 exemplifies i = 1, 
J = D 

■OO-^^-O-O' -> <K>"++ ! 00' 

2. i + 1 elements of a decreasing run (e.g., 14325 —» 143526 for i = 1, j = 1) 

••CHO^+P-O- -► O O 1 O -©• O O 


An analogous argument can be made for the falling atomic permutations 21“. 

Notice that every partition of a positive integer n can be represented by monomials 
in the ring of polynomials 2 Z[x 1 ,x 2 , ■ ■ .,x n ]. Namely, we can express a partition 

p = p 1 + p 2 H-1- p k as x pi x p2 ■ ■ ■ x pk (for example, the partition l+2 + 2 + 3of8 

is written as x 1 x 2 x 3 ). 

Now, let p be a partition andX the corresponding monomial. To this permutation 
there correspond Z a (p) permutations in 2l~ which can be extended to permutations in 
2l^ +1 in the manner described above. Introducing the (formally defined) differential 
operator 


^ 3 ^ 3 

9> = 9) 0 + 9> + with = > x i+ i—, 9)+= } x^iXj- -, (4.1) 

h dx i at t dx >+j 

we can describe this extension in terms of the action of 9) on X. We say that 9> 0 is 
the degree-preserving part of 9>; it represents the case 1 of increasing the length of a 
run: the differentiation 3/3x ; removes one of the runs of length i and replaces it by a 
run of length i + 1, keeping account of the number of ways in which this can be done. 
Similarly, case 2 of splitting a run into 3 parts is represented by the degree-increasing 
part 9> + . For example, each of the 7 atomic permutations corresponding to the 
partition 1 + 1+3 can be extended as 

9)x 2 x 3 = 2xiX 2 x 3 + x 2 x 4 + x^x 2 , 


i.e., each can be extended to two atomic permutations corresponding to the partitions 
1 + 2 + 3, one corresponding to 1 + 1 + 4 and one tol + 1 + 1 + 1+2. 

Therefore, starting from the trivial partition 1 of 1, represented as x 3 , we can 
construct a recurrence relation for polynomials A n = A n (x lt x 2 ,... ,x n ) which, at 
every step n > 1, encode the number of atomic permutations Z a (p) of length n + 1 

2 If one wants to encode also the order of the run (e.g., to obtain a map from permutations of 
length n to the compositions of n), one can exchange the polynomial ring with a noncommutative 
ring. Alternatively, if one wants to encode the direction of a run, one could study instead the ring 
yi> x 2 > y 2 >---L where x t denotes an increasing run of length i and y, encodes a decreasing run of 
length j. 
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with run structure given by a partition p of n as the coefficients of the corresponding 
monomial in A n . The polynomial A n , accordingly defined by 

n 

A n =Y t z ^Y\ x f°> (4 - 2) 

pbn i=l 

where the sum is over all partitions p of n and p(i) denotes the multiplicity of t in 
the partition p, can thus be computed from the recurrence relation 

A = x 1 , (4.3a) 

A n = <3>A n _ lt (n > 2). (4.3b) 

We say that the polynomials A n enumerate the run structure of the atomic permuta¬ 
tions. 

We summarize these results in the following proposition: 

Proposition 4.4. The number Z> a (p) of rising or falling atomic permutations of length 
n — 1 corresponding to a given run structure (i.e., a partition p of n), is determined by 
the polynomial A n via (4.2). The polynomials A n satisfy the recurrence relation (4.3). 

Note that atomic permutations always contain an odd number of runs and thus 
Z a (p) is zero for even partitions p. 

It will prove useful to combine all generating functions A n into the formal series 

oo n n oo t n 

am=X a +1 —=£> n A— 

n\ n\ 

n =0 n =0 

which can be expressed compactly as the exponential 

sf(A) = exp(A'2))A 1 . 

The first few A n are given by 


A = *2 
A = *3 + 

A = x 4 + 5x 2 X^ 

A 5 = x 5 + 7 x 3 x\ + llx^x 1 + 

A 6 = x 6 + 9 x 4 x^ + 11^2 + 38x3X2^! + 61 x 2 Xj. 

from which we can read off that there is 1 permutation in 21^ corresponding to the 
trivial partition 5 = 5, 7 corresponding to the partition 5 = 1 + 1 +3, 11 corresponding 
to5 = l + 2 + 2 and 5 corresponding to5 = l + l + l + l + l.Asa check, we note 
that 1 + 7+11 + 5 = 24, which is the total number of elements of 21^; similarly, the 
coefficients in the expression for A 6 sum to 120, the cardinality of 21^ . A direct check 
that the coefficients in A n sum to (n — 1)! for all n will be given in the last paragraph 
of Sect. 4.3. 

The first degree term A^ 1 -* of A n is x n as can be seen by a trivial induction using 
A(i) = ; j, which follows from the recurrence relation (4.3). Therefore Z a (n) = 

1. 
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For A® with k > 1 also the effect of 2> + has to be taken into account, complicating 


things considerably. Nevertheless, the general procedure is clear: once A^ 2) is known 
for all men, A® can be obtained as 


n -1 



m=k-1 


Here one can make use of the following relation. Applying 2> 0 repeatedly to any 
monomial x i} x l2 ■ ■ ■ x ik of degree k yields, as a consequence of the Leibniz rule. 



n 



x h+h x i2+i2--- x i k +h- (4-4) 


Jl> J2’ • • • > Jk 


Jl,j2,~,]k>0 

jl+h^ - kjk=n 


This observation provides the means to determine the third degree term A®. 
Applying 2) + to any = x m with m > 2 produces x 1 x p x q with p + q = m and 


p,q> 1. Moreover, the repeated action of 2> 0 on x 1 x p x q is described by (4.4) and 
thus 





p,q,r,s,t> 0 
l+p+q+r+s+t=n 


After some algebra this yields 

Proposition 4.5. The third degree term A® of the polynomial A n , n>3,is given by 



(4.5) 


i+j+k=n 


The equation (4.5) for the third degree term A® can be rewritten into a formula 
for Z^(pi + p 2 + p 3 ), i.e., the number of permutations of [n + 1] that start with 1, 
end with n + 1 and have three runs of lengths P\,p 2 ,P 3 , by changing the first sum to 
a sum over i,j,k e {pi,p 2 ,p 2 }- In particular, this gives rise to three integer series for 
the special cases 


Z a (r! + n + n), Z a (l + n + n), Z a (l + l + n), 


with n G N. 

The first series 



= 1,11,181,3499,73501,1623467,... (n > 1) 


gives the number of atomic permutations with three runs of equal length n. It does 
not appear to be known in the literature nor can it be found in the OEIS [164] and 
the existence of closed form expression is currently unkown. For the second series, 
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however, a simple closed form can be found: 



= 2 P J - 1 = 11,39,139,503,1847,..., (n > 2) 

is the number of atomic permutations in 21 ^ n+2 with two runs of length n. One may 
understand this directly: there are ( 2n ) permutations in which the length 1 run is 
between the others and ( 2n ) — 1 in which it is either first or last. The third series, 
Z a ( 1 + 1 + n), i.e., the number of atomic permutations in 2l^ +3 with two runs of 
length 1, is given by the odd numbers bigger than 3: 

Z a (l + 1 + n) = 2n + 1 = 5,7,9,11,13,15,..., (n > 2). 

Observe that terms of the form x" in A n encode alternating permutations, which 
were already investigated by Andre in the 1880’s [17]. As a consequence of his 
results, we find that the alternating atomic permutations are enumerated by the 
secant numbers S n , the coefficients of the Maclaurin series of secx = S 0 + S 1 x 2 / 2! + 
S 2 x 4 /4! + ---, 

/■ 2n+l -v 

Zj J] 1 J = S n = 1,1,5,61,1385,50521,... (n > 0, OEIS series A000364). 

This is due to the fact that all alternating atomic permutations of [2n] can be 
understood as the reverse alternating permutations of [2.. 2n — 1] with a prepended 
1 and an appended 2 n. Moreover, since any x 2n+1 can only be produced through an 
application of 28 on x 2 x^ n we also have Z a (2 + Xifii ^ l) = S n . 

4.2.2 Circular permutations 

The methods developed in the last section to enumerate atomic permutations can 
also be applied to find the number of circular permutations Z e ‘(p) with a given 
run structure p. Indeed, any circular permutation in £ n -i can be extended to a 
permutation in (£ n by inserting n at any position after the first (e.g., 1.4532 can be 
extended to 1.64532, 1.46532, 1.45632, 1.45362 or 1.45326). As in the case of 
atomic permutations, this extension either increases the length of a run or splits a 
run into three runs. Namely, we can increase the length of one run by inserting n 
at the end or the penultimate position of an increasing run or we can split a run of 
length i + j >2 into three runs of lengths i, j and 1 by inserting n after i elements of 
an increasing run or after i + 1 elements of a decreasing run. 

We introduce polynomials C n representing the run structures of all elements of 
€ n , by analogy with the polynomials A n in the previous section: 

Q = 2 z c(p)fW !) (4 ‘ 6) 

pbn i=l 

and we say that the polynomials C n enumerate the run structure of the circular 
permutations. In the last paragraph we saw that we can use the differential operator 
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25 introduced in (4.1) to find a recurrence relation similar to (4.3). Namely, 

C 2 = x\, (4.7a) 

C n = aC n _ l5 (n > 3) (4.7b) 


giving in particular 

C 3 = 2 x 2 x 4 

C 4 = 2^2 + 2 x 3 X! + 2x\ 

C 5 = 2xa t x 1 + 6x 3 x 2 + 16X3X3 

C 6 = 2 x 5 x 4 + 8x 4 x 2 + 6X3 + 62X3X3 + 26X3X3 + I6X3 

from which we can read off that there are 2 permutations in <£ s corresponding to 
5 = 44-1,6 corresponding to the partition 5 = 34-2 and 16 corresponding to 
5 = 24-14-14-1. As a check, we note that 6 4-164-2 = 24, which is the total number 
of elements of C 5 ; similarly, the coefficients in the expression for Cg sum to 120, the 
cardinality of C 6 . More on this can be found in the last paragraph of Sect. 4.3. 

In summary, we have a result analogous to Prop. 4.4: 

Proposition 4.6. The number Z e -(p) of circular permutations of length n correspond¬ 
ing to a given run structure p is determined by the polynomial C n via ( 4 . 6 ). The 
polynomials C n satisfy the recurrence relation ( 4 . 7 ). 

Note that circular permutations, exactly opposite to atomic permutations, always 
contain an even number of runs and thus Z e -(p) is zero for odd partitions p. 

The enumeration of circular and atomic permutations is closely related. In fact, 
introducing a generating function '€ as the formal series 

00 x n 00 x n 

W) = Y. C n+ 2 — = y *b n C 2 — = exp(A 25 )C 2 , 

^ n! t=o n ' 

one can show the following: 

Proposition 4.7. The formal power series % is the square of a formal series si; namely, 

'g(A) = sl(A) 2 = (exp(A 9 ))A 1 ) 2 , ( 4 . 8 ) 


where A 1 = Xj. 

Proof. This may be seen in various ways, but the most convenient is to study the 
first-order partial differential equation (in infinitely many variables) 

dTo 

— - = 0, <8(0) = C 2 (4.9) 


satisfied by c 6. 

We can now apply the method of characteristics to this problem. Since it has no 
inhomogeneous part, the p.d.e. (4.9) asserts that ^ is constant along its characteris¬ 
tics. So, given A and x 1; x 2 ,..., let ^i(/i), ... be solutions to the characteristic 
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equations with / r (A) = x r , i.e., X\ (/i), Xi (a 0> • • • are the characteristic curves which 
emanate from the point (A, x la x 2 , ...). Then, 

W)\ x . = <§(0)1^0) = C 2 ( Zl (0)) = Zl (0) 2 . 

Applying the same reasoning again to si, which obeys the same p.d.e. as % but with 
initial condition sf(0) = A l5 

= rf (0)| z .( 0 ) =A 1 (*1(0)) = *i(0). 


Therefore, Prop. 4.7 follows by patching these two equations together. □ 


As a consequence also the polynomials A n and C n are related via 


n—1 


m=l v y 


] A m A n _ m . 


It then follows that the second degree part of C n is given by 


n -1 


cf = S 


m=l 


n — 2 
m- 1 


and, applying (4.5), that the fourth degree part can be written as 


(4.10) 


i+j+k+l=m 

Similar to the atomic permutations, we find that the alternating circular permuta¬ 
tions satisfy ( cf. [16, §41]) 


n—l—q—2 


Z € 



= T 

x n 


1,2,16,272,7936,353792,... (n > 1, OEIS series A000182) 


and also Zg(2 + 3 l) = T n , where T n are the tangent numbers, the coefficients 

of the Maclaurin series of tanx = T 1 x 1 + T 2 x 3 /3! + T 3 x 5 / 5! + ■ ■ ■. Furthermore, 
from (4.10) we find the relation 


= 2 

m =0 

which can be traced back to tan 7 x = sec 2 x. 

To conclude this section, we note that the argument of Prop. 4.7 proves rather 
more: namely, that exp(A2)) defines a ring homomorphism from the polynomial ring 
C[x 1; x 2 ,... ] to the ring of formal power series C[[x 1; x 2 ,...]]. This observation can 
be used to accelerate computations: for example, the fact that A 3 = x 3 + x 3 implies 
that 

s#"(A) = al( A) 3 + exp(A2>)x 3 , 


2 n 
2m 


c S 

'-'m'-’n—m-) 


which reduces computation of A n+3 = 2> n+2 x 1 to the computation of 2> n x 3 . Once s1 
is obtained, we may of course determine ^ by squaring. 
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4.2.3 Linear permutations 

In the last section we studied the run structures of circular permutations C n and 
discovered that their run structures can be enumerated by the polynomials A n . One 
might ask, what the underlying reason for this is. Circular permutations of [n] 
have the same run structure as the linear permutations of the multiset {1,1,2,..., n} 
which begin and end with 1. These permutations can then be split into two atomic 
permutations at the occurrence of their maximal element. For example, the circular 
permutation i.4532 can be split into the two atomic permutations 145 of {1,4,5} 
and 5321 of {1,2,3,5}. This also gives us the basis of a combinatorial argument for 
the fact that % = si 2 . Similarly it is in principle possible to encode the run structures 
of any subset of permutations using the polynomials A n . The goal of this section is to 
show how this may be accomplished for 6 S for any S c N. 

As in Sects. 4.2.1 and 4.2.2, we want to find polynomials 

Ln = Y^ Z e^t\ x i W 

phn i=l 

that enumerate the run structure of the permutations 6 n+1 . This may be achieved 
in a two step procedure. Since every permutation has a unique decomposition 
into inextendible atomic permutations, we can enumerate the set of permutations 
according to this decomposition. The enumeration of permutations by their run 
structure follows because the enumeration of atomic permutations has already been 
achieved in Sect. 4.2.1. 

The key to our procedure is to understand the factorisation of the run structure 
into those of atomic permutations. Considering cr e <5 n as a word, we can write it as 
the concatenation a = a ■ n ■ co, where n is the principal atom of cr (see Sect. 4.1.3) 
and a, co are (possibly empty) subwords of cr. Since the decomposition of cr into its 
atoms also decomposes its run structure, the complete runs of cr are determined by 
the runs of a • 1, n and n ■ co if n is rising, or of a ■ n, n and 1 • co if n is falling. 

Let be the set of letters in co and define p : to be the involution 

mapping the i’th smallest element of S 0J to the i’th largest, for all 1 < i < |SJ. Then 
the run structure of n ■ co is identical to that of 1 • p(u>), where p(o>) is obtained 
by applying p letterwise to co. Furthermore, in the case n = 1 • • • n, the combined 
run structures of a ■ 1 and n ■ co are precisely the run structure of a ■ 1 ■ p(o>), while, 
if n = n ■ ■ ■ 1, the combined run structures of a ■ n and 1 • co precisely form the 
run structure of a ■ n ■ p(o>). We refer to a ■ 1 • p(o>) or a ■ n ■ p(co ) as the residual 
permutation. 

Summarising, the run structure of cr may be partitioned into that of n and 
either a ■ 1 • p(o>) or a ■ n ■ p(o>); accordingly, the monomial for cr factorises into 
that for the principal atom n and that for the residual permutation. Therefore, the 
polynomial enumerating linear permutations by run structure can be given in terms 
of the those enumerating atomic permutations of the same or shorter length and of 
linear permutations of strictly shorter length. 

This argument can be used to give a recursion relation for L n , which enumerates 
permutations of [n + 1] by their run structure. Taking into account that the principal 
atom consists of m + 1 letters, where 1 < m < n, of which m — 1 may be chosen 
freely from the set [2.. n], and that it might be rising or falling, and that the residual 
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permutation may be any linear permutation on a set of cardinality n — m + 1, we 
obtain the recursion relation 


L 


n 



Passing to the generating function, 


L 0 = 1. 


we may deduce that 


2(A) 




n =0 


A n 

nT’ 


32 

3A 


2sl(A)2(A). 


Our main result in this section is: 


(4.11) 


Proposition 4.8. The run structure of all permutations in <5 n+1 is enumerated by 



(4.12) 


where the sum is over all partitions p = p 1 + p 2 4- of n, \p\ is the number of parts of 

partition, ordp is the symmetry order of the parts of p (e.gfor p = l + l+ 2 + 3 + 3 
we have ordp = 2!2!) and (”) is the multinomial with respect to the parts of p. The 
generating function for the L n is 


oo -^n 

W) = I> n - = exp J 2 

n =0 


r ^ 


sd(ju) d/i 


(4.13) 


Proof. Equation (4.13) follows immediately from (4.11), as 2(0) = 1, whereupon 
Faa di Bruno’s formula [166, Eq. (1.4.13)] yields (4.12). □ 


To conclude this section, we remark that the first few L n are given by 

L 1 = 2A X 

L 2 = 4 + 2 A 2 

L 3 — 8A^ 4~ \2A 3 A 2 4" 2A 3 

L 4 = 16A* + 48 AIA 2 + 12A^ + \ 6 AiA 2 + 2 A 4 

L 5 = 32A 5 x + 160A^4 2 + UOA^l + 80 a\A 3 + 40A 2 A 3 + 20A 3 A 4 + 2A 5 
L 6 = 64A^ + 480A^4 2 + 320A^4 3 + 720A^ + 120A^4 4 + 480A 1 A 2 A 3 + 1207^ 
+ 24A 3 A 5 + 60A 2 A 4 + 40Ag + 2Ag. 
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Expanding the A k and writing the L n instead in terms of x n we obtain from these 

Li = 2x 3 

L 2 = 4x^ + 2x 2 

L 3 = 10x 3 + 12x 1 x 2 + 2x 3 

I 4 = 32x^ + 58x 2 x 2 + 12x^ + 16x 1 x 3 + 2x 4 

L 5 = 122x 3 + 300x 3 x 2 + 142 x 3 X2 + 94x 2 x 3 + 40x 2 x 3 + 20x 1 x 4 + 2x 5 
L 6 = 544Xj + 1682 x 4 x 2 + 568x 3 x 3 + 1284x^2 + 138x 2 x 4 + 556xjx 2 x 3 + 142x^ 
+ 24 x 3 x 5 + 60x 2 x 4 + 40x 2 + 2x 6 , 

which show no obvious structure, thereby making Prop. 4.8 that much more remark¬ 
able. 

4.3 Enumeration of valleys 

Instead of enumerating permutations by their run structure, we can count the number 
of valleys of a given (circular) permutation. Taken together, the terms C n involving a 
product of 2k of the x, relate precisely to the circular permutations (£ n with k valleys. 
Since any circular permutation in can be understood as a permutation of [3.. n+1] 
with a prepended 1 and an appended 2 ( cf. beginning of Sect. 4.2.3), C n may also be 
used to enumerate the valleys of ordinary permutations of [n — 1]. Namely, terms 
of C ri+ 1 with a product of 2(/c + 1) variables x ; relate to the permutations of 6„ with 
k valleys (i.e., terms of L n+1 which are a product of 2k of the x ; ). 

Let V(n, k) count the number of permutations of n elements with k valleys. Then 
we see that the generating function for V(n, k) for each fixed n > 1 is 



and we define K 0 (k) = 1. The first few K n are 


K^k) = 1 

K 2 M = 2 

K 3 (k) = 4 + 2 k 
K 4 (k) = 8 + 16k 
1C 5 (k) = 16 + 88k + 16k 2 
1C 6 (k) = 32 + 416k + 272k 2 


which coincide with the results in [183]. In particular, the constants are clearly the 
powers of 2, the coefficients of k give the sequence A000431 of the OEIS [164] and 
the coefficients of k 2 are given by the sequence A000487. Likewise, the coefficients 
of k 3 may be checked against the sequence A000517. In fact, the same polynomials 


appear in Andre’s work, in which he obtained a generating function closely related to 


(4.14) below; see [16, §158] (his final formula contains a number of sign errors, and 
is given in a form in which all quantities are real for k near 0; there is also an offset, 
because his polynomial A„(k) is our fC n _ 1 (K)). 
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Proposition 4.9. The bivariate generating function, i.e., the generating function for 
arbitrary n, is 


00 yll ^ 

JC(v,k) = YK nM - = 1 + - 
n\ k 


rv 


n =0 


^(.H)\ Xl =x 2 =-=^ 


and is given in closed form by 


3£(v, k) = 1-1---tan fv \J k — \ + arctan(l j\J k — 1)). 

K K 


(4.14) 


This result was found by Kitaev [138] and in the remainder of this section we will 
show how it may be derived from the recurrence relation (4.7) of C n . 

To this end, we first note that C n+1 satisfies the useful scaling relation 

A n+1 C n+1 (x 1 ,x 2 ,...,x n ) = C n+ i(Ax 1 ,A 2 x 2 ,...,A n x n ). 


Setting x ; = x/A = -Jk for all i, this implies 

A" +1 C n+ i(v^, • • •, Vk) = C n+1 {x, Ax,..., A n_1 x) 


and we find, by inserting the recurrence relations (4.7) and applying the chain rule, 
that with this choice of variables 

1 , dC n 0 dC n 

—^C n+1 (x,Ax,...,A n x x) = Ax —-1-x 2 —— + 2AC n . 

x A ox OA 

Hence, in turn, fC„(/c) = fc -1 C n+1 ( <Jk, ..., -Jk) satisfies the recurrence relation 

K n { k) = 2k(1 - kK'.jCk) + (2 + (n - 2)fc)JC n _ 1 (ic) (4.15) 

for n> 2. For the bivariate generating function 3fC this, together with K 0 = K 1 = 1, 
implies the p.d.e. 

dJC dJC 

(1 — vjc) —-h 2 k(k: — 1)—-[- (k — = k — 1, 

a v ok 


which is to be solved subject to the initial condition 9£(0, »c) = 1. 

The above equation may be solved as follows: first, we note that there is a 
particular integral 1 — 1/k, so it remains to solve the homogeneous equation. In turn, 
using an integrating factor, the latter may be rewritten as 


(1 — vk) 


d kJC 
dv f K — 1 


+ 2fc(fc — 1) 


k3E 


dK Vk- - 1 


= 0, 


(4.16) 


for which the characteristics obey 


dv 1 — VK 
dK 2 k(k — 1) 


Solving this equation, we find that 


v \f k — 1 + arctan 


Vk — 1 


= const 
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along characteristics; as (4.16) asserts that k3C/ Vk — 1 is constant on characteristics, 
this gives 

3£(v, k) = 1-1- -- / (v^/k — 1 + arctan(l / y/ k — 1)) 

KK 

for some function /. Imposing the condition 3£(0, x') = 1, it is plain that / = tan, and 
we recover Kitaev’s generating function (4.14). 

To close this section, we note that (4.15) has the consequence that fC n (l) = 
riK n _i( 1) for all n > 2 and hence that C n+1 (l,..., 1) = JC n (l) = n\ for such n, and 
indeed all n > 1, because C 2 (l,l)=fC 1 (l) = l. The generating function obeys 

GO yjl 

W)\ Xm =i = y ](n + 1)!— = (1 - A)- 2 
to n! 

for all non-negative A < 1 from which it also follows that 

^(A)| x .=i = (1 — A) -1 (4.17) 

(as Aj(l) = 1, we must take the positive square root) and hence A n | x<=1 = (n — 1)! 
for all n > 1. This gives a consistency check on our results: the coefficients in the 
expression for A n sum to {n — 1)!, the cardinality of 2l^ +1 , while those in C n sum to 
the cardinality of <t n . Furthermore, inserting (4.17) into the generating function 17(A) 
in (4.13), we find 

00 \ n 

se(A)| x#=1 = VL„( 1,..., 1)— = (1 - A)- 2 , 
to nl 

and thus L n+1 ( 1,..., 1) = n!, which is the cardinality of 6 n . 


4.4 Applications 


The original motivation for this work arose in quantum field theory, in computations 
related to the probability distribution of measurement outcomes for quantities such 
as averaged energy densities [90], One actually computes the cumulants K n (n e N) 
of the distribution: k 1 = 0, while for each n > 2, K n is given as a sum indexed by 
circular permutations a of [n] such that cr(l) = 1 and cr(2) < cr(n), in which each 
permutation contributes a term that is a multiplicative function of its run structure: 

K n = y *(o0 

<7 


where ^(cr) is a product over the runs of cr, with each run of length r contributing 
a factor y r . Owing to the restriction cr(2) < ci(n), precisely half of the circular 
permutations are admitted, and so K n = ^C n (y 1 ,y 2 ,--.,y n )- Thus the cumulant 
generating function is 


W(A) = y 

n=2 


A n 


n\ 


1 

2 


- A 

dpi (A - pO'gfpOl 
Jo 


1 

2 


dpi(A-piMpi)| 2 <=y< 


-y. 
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and the moment generating function is exp W(A) in the usual way. This expression 
makes sense a formal power series, but also as a convergent series within an appropri¬ 
ate radius of convergence. The values of y n depend on the physical quantity involved 
and the way it is averaged. In one case of interest 




r \ 

Yn = S n 

dk 3 d k 2 ■ ■ ■ d k n kik 2 ■ ■ ■ k n exp 

-h-l V|/c (ll -k,! -fc n 

* 

(M+) xn 

V i=l J 


2 2+r n _i 2 +r 2 n —1 

= 2 " s £ 

r„-i=0r n _ 2 =0 rj=0 fc=l 


= 2,24,568,20256,966592,... (n > 1) 

(the sums of products must be interpreted as an overall factor of unity in the case 
n = 1). Numerical investigation leads to a remarkable identity 

2 

s*Wlx.=y. = 1 _ m (conjectured) 

with exact agreement for all terms so far computed (checked up to n = 65). We 
do not have a proof for this statement, but the conjecture seems fairly secure. For 
example, we have shown above that A 5 = x 5 + 7x 3 x 2 + llx^x 1 + 5x 3 ; substituting 
for x n the values of y n obtained above, we find A 5 = 995328 which coincides with 
the fourth order coefficient in the expansion 

2 7c* A 4 

-- = 2 + 24A + 576— + 20736— + 995328— + 0(A 5 ). 

1-12A 2! 3! 4! 

In [90], this conjecture was used to deduce 

exp (W(A)) = e“ A/6 (l - 12A)“ 1/72 (conjectured), 


which is the moment generating function of a shifted Gamma distribution. The other 
generating functions of interest, with these values for the x k are 


^(^01 x.=y. 


4 

(1 — 12A) 2 ’ 


SS(A)| x>=y< = (1 — 12A) 1 ^ 3 (conjectured). 


For example, we have C 5 = 2x 4 x 1 + 6x 3 x 2 + 16x 3 x 2 = 165888 and L 5 = 122x 3 + 
300x 3 x 2 + 142x^2 + 94x 2 x 3 + 40x 2 x 3 + 20x 1 x 4 + 2x 5 = 3727360, to be compared 
with the terms of order A 3 and A 5 , respectively, in the expansions 


4 A 2 A 3 a 4 

---r = 4 + 96A + 3456— + 165888— + 995328— + 0(A 5 ), 

(1-12A) 2 2! 3! 4! 

A 2 A 3 A 4 A 5 

(1 - 12A)“ 1/3 = 1 + 4A + 64— + 1792— + 71680— + 3727360— + 0(A 6 ). 

2! 3! 4! 5! 

A natural question is whether there are other sequences that can be substituted 
for the x k to produce generating functions with simple closed forms. To close, we give 
three further examples, with the corresponding generating functions computed. The 
first has already been encountered in Sect. 4.3 and corresponds to the case x k = 1 for 
all k e N. 
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The second utilizes the alternating Catalan numbers: setting 


_ (-D fc 

* 2fc+1 “ ITT 



(fc > 0), x 2k = 0, (k > 1) 


and thus A 2k = 0, we obtain, again experimentally, 


^(A) = al(A) = 1, S£(A) = e 2A (conjectured) 


with exact agreement checked up to permutations of length n = 65. For example, one 
sees easily that with x 1 = 1, x 3 = —1, x 5 = 2 and x 2 = x 4 = 0, the expressions A k 
and C k given in Sects. 4.2.1 and 4.2.2 vanish for 2 < k < 6, and have A 1 = C 1 = 1, 
likewise, L k = 2 k for 1 < k < 6. 

Third, Andre’s classical result on alternating permutations (c/last and penultimate 
paragraph of Sects. 4.2.1 and 4.2.2 respectively) gives the following: setting 


x 1 = 1 and x k = 0, (k > 2) 


we have, using (4.8) and (4.13), 

&!(A) = sec A, ^(A) = sec 2 A, Sf(A) = (sec A + tan A) 2 . 


It seems highly likely to us that many other examples can be extracted from the 
structures we have described. 

Moreover, we remark that it is possible to implement a merge-type sorting algo¬ 
rithm, called natural merge sort [139, Chap. 5.2.4], based upon splitting permutations 
of an ordered set S into its runs, which are ordered (alternatingly in ascending and 
descending order) sequences S( c S. Repeatedly merging these subsequences, one 
ultimately obtains an ordered sequence. For example, first, we split the permutation 
542368719 into 542, 368, 71 and 9. Then, we reverse every second sequence 
(depending on whether the first or the second sequence is in ascending order): 542 
—> 245 and 71 —> 17. Depending on the implementation of the merging in the 
following step, this ‘reversal’ step can be avoided. Last, we merge similarly to the 
standard merge sort: 245 V 368 —> 234568, 17 V 9 —> 179 and finally 234568 
V 179 —> 123456789. Natural merge sort is a fast sorting algorithm for data with 
preexisting order. Using the methods developed above to enumerate permutations 
by their run structure, it is in principle possible to give average (instead of best- and 
worst-case) complexity estimates for such an algorithm. 



Quantum field theory 


Is the purpose of theoretical physics to be no more than a cataloging of 
all the things that can happen when particles interact with each other and 
separate? Or is it to be an understanding at a deeper level in which there are 
things that are not directly observable (as the underlying quantized fields 
are) but in terms of which we shall have a more fundamental understanding? 

— Julian S. Schwinger, “Quantum Mechanics” (2001), p. 24 f. 

First, in order to achieve the greatest possible generality we continue our 
total boycott of the canonical formalism [...]. 

— Bryce S. DeWitt, J. Math. Phys. 3 (1962), p. 1073. 




Locally covariant quantum field theory 


Summary 

In this chapter we will discuss the framework of locally covariant quantum field theory. 
In its present form it was introduced in [46] but many of its central ideas can already 
be found in earlier publications. It may be understood as a generalization of the 
Haag-Kastler axioms [109, 110] to curved spacetimes but it also differs in some subtle 
points because the Haag-Kastler axioms are ‘more global’ [see, for example, [30, 32] 
on the problem of gauge theories in locally covariant QFT). A generalization of the 
Haag-Kastler was performed by Dimock whose work [72-74] can be understood as 
the foundation of modern algebraic QFT on curved spacetime. Building on the work 
of Dimock, the paradigm of locally covariant quantum field theory should be seen a 
culmination of work done by Brunetti, Fredenhagen, Hollands, Kay, Verch, Wald and 
others on QFT on curved spacetimes, in particular renormalization, [44, 45, 123, 124, 
135, 215] around the turn of the millennium after the discovery of the microlocal 
spectrum condition [45, 181, 182], 

After discussing some general considerations leading to algebraic and locally 
covariant quantum field theory in the first section (Sect. 5.1), we will introduce the 
the general framework of locally covariant quantum field theory in the second section 
(Sect. 5.2). More details on the locally covariant framework may be found in the 
original publication [46] or also [89, 190] among many others. This is followed 
by an abstract study of the Borchers-Uhlmann algebra, the commutation relations 
and the field equation, which will lead to the field algebra, and the Weyl algebra 
in the third section (Sect. 5.3), we will discuss two free bosonic fields in the locally 
covariant framework: the scalar field and the Proca field. 


5.1 General considerations 

Quantum field theory is a very complex subject which cannot easily be defined. 
This is partly due to the fact that quantum field theory is not so much a physical 
theory but rather a language to formulate theories and models. However, a more 
important reason is that quantum field theory, even many decades after its inception, 
has no clear interpretation, e.g., it is often not clear what the physical objects are. 
Nevertheless, it can be considered one of the most successful scientific discoveries 
ever conceived and some predictions made by quantum field theory have been tested 
with astonishing precision. For example, the anomalous magnetic moment of the 
electron has been measured in agreement with theoretical predictions in the parts in 
a trillion range, see [18, 117]. 

The relation of QFT to other theories is schematically depicted in the diagram 
Fig. 5.1. In particular, QFT may be considered as a Lorentz invariant quantum 
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Figure 5.1 . The heuristic relation of quantum field theory with classical mechanics, 
classical field theory and quantum mechanics. 

mechanics in the case of infinitely many degrees of freedom. One can also argue that 
a consistent reconciliation of quantum mechanics with special relativity (in particular 
locality) leads invariably to QFT, i.e., fields and an infinite number of degrees of 
freedom are necessary, see [149] and also [52]. 


5.1.1 Lagrangian QFT 

Quantum field theory is usually formulated in the relatively heuristic approach of the 
Lagrangian formalism, where, starting from a classical Lagrangian, one imposes the 
canonical commutation relations between the quantized position and momentum vari¬ 
ables. In analogy to the quantum mechanical harmonic oscillator these yield creation 
and annihilation ‘operators’ on an abstract representing Hilbert space. Combining 
the creation and annihilation operator, one furthermore defines the quantum field. 
A specific Hilbert space is then selected by requiring that the annihilation operator 
annihilates a particular vector in the Hilbert space, the vacuum, so that one obtains 
the Fock space representation. 

Apart from not being mathematically rigorous, the Lagrangian formalism has 
several conceptual drawbacks. First, it neglects a priori the inequivalent irreducible 
representations of the canonical commutation relations (as a consequence of the 
failure of the Stone-von Neumann theorem in infinite dimensions) and instead selects 
the convenient Fock space representation. However, it is not obvious what is physical 
and whether the inequivalent representations are simply mathematical artefacts or 
physically relevant. Indeed, the existence of superselection sectors shows that the 
presence of inequivalent representations is certainly not irrelevant. A closely related 
issue is described by Haag’s theorem [110, Chap. II.1.1] which implies that the 
standard Fock space representation of the free theory is inequivalent to the that of 
the interacting theory. 

Second, the fundamental entities in the Lagrangian formalism are ‘operators’ 
at a point and thus neither mathematically not physically meaningful. Physically, 
because it would require an infinite amount of energy to localize a field a point. 
Mathematically, because a field at a point is not an operator on a Hilbert space but 
only an operator-valued distribution. Instead one should consider field which have 
no sharp localization but are smeared out over a region of spacetime. That is, the 
fundamental entities are quantum fields smeared with compactly supported test 
functions. 
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Third, the Lagrangian formalism contains global operators, like the particle 
number operator, which are not operationally meaningful because they cannot be 
reproduced by measurements in a bounded region of spacetime. In fact, due to the 
Reeh-Schlieder theorem, also local number operators cannot exist [110, Thm. 5.3.2], 
In this light the common interpretation of QFT in terms of localized particles is very 
problematic. On curved spacetimes or for accelerated observers the situation is even 
more problematic: as shown by the Unruh effect, the particle interpretation appears 
to be context dependent. 

5.1.2 Algebraic QFT and locality 

In the algebraic approach to quantum field theory, developed by Haag and collab¬ 
orators, the problems indicated above are addressed in a conceptually simple way: 
Rather than taking as observables operators on a Hilbert space, in the algebraic 
approach one discards the concrete representation of the operators and considers 
only the algebraic relations satisfied between the operators. Indeed, the relations 
between the observables already contain a large part of the physical content of the 
theory. 

The central pillar of algebraic quantum field theory is locality, better described 
by the German word “Nahwirkungsprinzip”. Locality means that causally unrelated 
events do not influence each other and it is implemented in the following way: To 
every spacetime region U we can associate a local algebra of observables sl(17) which 
can be measured within U. Consequently, we demand that map L/ ■—»s4(C7) forms an 
inductive system, i.e., it satisfies the isotony condition 

U c V => rf([7) c rf(V) 

or at least that there is an injective homomorphism £#([/) —* s!(V ); the correspondence 
U > &'!([/) for all U is called the net of local algebras. Further, we require that the 
local algebras of causally separated, causally convex regions (anti-) commute 

i/XV ==> [d([7),d(V)] = {0}. 

In the next section it will become clear how these conditions can be consistently 
imposed on different spacetimes. 

Following the choice of words of Haag [110], in any concrete case, the smeared 
quantum fields may be seen as a way to ‘coordinatize’ the local algebras. That is, 
they provide a map from test functions supported a spacetime region, to the local 
algebra supported in that region. From this point of view it seems clear that different 
quantum fields can lead to equivalent algebras. The notion of quantum fields might 
become clearer within the categorical framework to be introduced the next section. 

5.2 Framework 

5.2.1 Background structure 

The physical Universe appears to be well-modelled by a connected, oriented and 
time-oriented, four-dimensional Lorentzian manifold (M, g, ±, u), i.e., a spacetime as 
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/\ 

♦- \ / --X-* 






Figure 5.2. Example of an hyperbolic embedding and a non-hyperbolic embedding; 
see also Fig. 2.3. 


defined in Sect. 2.1. Moreover, for every possible observer to carry out experiments 
in a finite region of spacetime one has to require that J + (x) nJ“(y) is compact for 
all x,y e M. If we further require that no closed causal curves exist so that time 
travel is impossible, we have collected all necessary and sufficient conditions for a 
globally hyperbolic spacetime. Accordingly we consider as physical spacetimes all 
globally hyperbolic spacetimes. Of course we restrict ourselves to globally hyperbolic 
spacetimes also for technical reasons. In particular, only on globally hyperbolic 
spacetimes do we have a good understanding of the Cauchy problem for the wave 
equation. 

To implement simultaneously covariance and the principle of locality, i.e., an 
observer can conduct experiments in a globally hyperbolic subregion of the Universe 
and may remain ignorant about processes in the complement of that region, we 
consider as morphisms between globally hyperbolic spacetimes M,N the isometric 
embeddings t/» : M —> IV that are orientation and time-orientation preserving and 
whose image t/>(M) is a causally convex region of N. We call these morphisms 
hyperbolic embeddings; an example and a counter-example are shown in Fig. 5.2. If 
the image of the hyperbolic embedding xp contains a neighbourhood of a Cauchy 
surface in N, we say that it is Cauchy. 

The set of globally hyperbolic spacetimes as objects with the hyperbolic embed¬ 
dings as morphisms forms a category denoted Loc. This category was introduced 
in [46] and it is arguably the most fundamental but, as already mentioned in [46] , 
not the only possible choice to describe local theories. In fact, it has been altered in 
the literature in various ways 

• to accommodate more background structure by adding to the triple of manifold, 
metric and time-orientation, which is each object, additional elements like 
spin-structure [193], affine, principle and vector bundles [30-32] or external 
currents [2]; 

• to account for additional symmetry by allowing for more general morphisms 
like conformal isometries [1, 177, 5]; 

• to allow for the formulation of theories that are sensitive to the topology of the 
manifold, e.g., by restricting the set of objects to manifolds that have certain de 
Rahm cohomology groups [1, 5], 

More recently, it was suggested by Fewster [85] to consider as objects triples which 
instead of a metric have a global (co)frame; the morphisms are changed accordingly. 
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(a) 


(b) 


Figure 5.3. An illustration of (a) two causally disjoint embeddings and (b) a Cauchy 
embedding. 


The resulting category is larger and encompasses the original setting via a forgetful 
functor but the additional structure allows for an interesting discussion of the spin- 
statistics theorem. 


5.2.2 Observables 


A theory in this categorical framework is a covariant functor from Loc into a category 
whose objects describe physical systems and whose morphisms encode embeddings 
of physical systems. In quantum field theory (on curved spacetimes) in the algebraic 
formulation, physical systems are modelled by *-algebras or C*-algebras. Denote 
by *Alg the category whose objects are unital topological *-algebras with morphisms 
given by the unit-preserving *-monomorphisms and call a covariant functor si : 
Loc —* *Alg realizing such an algebra on each background in Loc a locally covariant 
theory. The algebra sOM) thus associated to each spacetime M e Loc is often called 
the algebra of observables although in many cases it may contain elements that are 
not actually physically accessible. 

Given two hyperbolic embeddings t/q : M, —* N, i = 1,2, such that the im¬ 
ages t/q(M ; ) are causally disjoint in N ( cf. Fig. 5.3(a)), we say that si is causal 
if 


[si(xp 1 )si(M 1 ),si('ip 2 )si(M 2 )] = {0}. 


Causality of si is closely related to its tensorial structure as discussed in [48] . 
The theory si obeys the timeslice axiom if 

si(xp)si(M) = si(N) 


for all Cauchy embeddings ip : M —> N (cf. Fig. 5.3(b)). The timeslice axiom is a 
prerequisite for the relative Cauchy evolution, which describes the response of the 
physical system to a perturbation of the background structure. 

More concretely, let (M, g, ±, u ) and ( M[h],g + h, ±, u h ) be globally hyperbolic 
spacetimes such that h is a compactly supported symmetric tensor field and u h is the 
unique time-orientation that agrees with u outside the support of h. 1 Consequently 
there exist neighbourhoods around two Cauchy surfaces in M [h] , one in the past 
of h and the other in the future. We can then find Cauchy morphisms t ± and t[fr] 
from spacetimes M ± e Obj(Loc) into M and M[h] as shown in Fig. 5.4. Together 

1 Note that a possible ‘perturbation’ is always M[0] but there exists also a neighbourhood U of 0 in 
the set of compactly supported symmetric covariant two-tensor fields (with the test function topology) 
such that ( M[h],g + h ) is globally hyperbolic for all hGU, see [27, Thm. 7.2]. 
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Figure 5.4. Illustration of the morphisms in the relative Cauchy evolution with the 
unperturbed background M on the left and the perturbed background M [ h ] on the 
right. 


these Cauchy morphisms make up the (*-algebra) homomorphism that is the relative 
Cauchy evolution map 

rce[h] = ^(t _ ) orf(i[/i]") _1 oi</(i[/i] + ) o 5/((, + ) _1 . 


It was shown in [46] that (in an appropriate topology, see [2, 98] for details). 


-2i —rc e[eh]A 
as 


£=0 


inhlA] 


for any A e and where T(K) e sf(M) is symmetric and conserved. Since T is 

both symmetric and conserved it may be interpreted as a stress-energy tensor [46, 
97, 98] and, in fact, in concrete models this interpretation is valid [28, 46, 93, 193], 


5.3 Generalized Klein-Gordon fields 

The Klein-Gordon field is usually the first field to be discussed when studying quantum 
field theory. We will be no different although we will perform some straightforward 
generalizations. Namely, we will quantize fields on natural vector bundles that satisfy 
a normally hyperbolic equation of motion. The results below are a generalization 
of those obtained in [46] for the scalar Klein-Gordon field. In principle, further 
generalizations of the definitions and statements presented below are possible. For 
example, one can replace compactly supported p -forms by compactly supported 
sections of ‘natural’ vector bundles, i.e., vector bundles that, like the (co)tangent 
bundle, are functorially constructed from the geometric structure of the manifold. 
However, all these generalizations yield little insight and obfuscate some constructions. 
Moreover, the requirements imposed by the usual locally covariant framework make 
it difficult to find examples that do not just use a standard tensor bundle tensorized 
with a vector bundle that is independent of the geometry of the spacetime. 

5.3.1 Borcher-Uhlmann algebra 

For every globally hyperbolic spacetime M, let 2 be a covariant functor from Loc 
into the category of closed nuclear locally convex C-vector spaces such that 2(M) c 
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12q(M, C) (for fixed p and with the subspace topology) and 3(ip) = xp*. 

On each spacetime M we can define a straightforward generalization of the 
Borchers-Uhlmann algebra [38, 39, 213] as the unital topological *-algebra 

S/(M) = © 3) (M)® n 

nG Nq 

with, t.e., the set of tuples (f n ) n e® with f n G 3(M)®' ! such that only a finite number 
of f n is nonzero, together with 

(a) addition and scalar multiplication is component-wise, 

(b) multiplication given by the canonical isomorphic embedding 

3 (M) Sm ® 3)(M) §n —» S(M) S(m+n) 

and extends (anti-) linearly to all of ?Z(M) via the canonical embeddings of 
these spaces into 2^(M), 

(c) a *-operation that acts on (/„) e 2ff(M) as (/„)* = (/*) and 

/*(x 1 ,...,xj = / n (x n ,...,x 1 ), 

(d) a topology given by the direct sum topology of the test function topology on 
each 2>(M)® n . 2 

Assigning to each globally hyperbolic spacetime the Borchers-Uhlmann alge¬ 
bra 2^(M), we obtain a covariant functor 2^ : Loc —* *Alg that maps each object to 
the algebra and each morphism to the *-algebra morphism generated by the natural 
pushforward i.e., 

^W0(/ n ) = (Vh/J with IpJn =fn° (t©)®" 
for all (/„) e ^(M). 

Considering 2f(M) as a topological *-algebra where the involution is complex con¬ 
jugation, we can consider it as a functor from Loc to *Alg. The natural transformation 
4> : 3> -=-» which for each M e Obj(Loc) is the canonical map 

£ m : S(M) -» ®(M), /~(0,/,0,...), 

is called the (locally covariant) quantum field associated to C U. Observe that every 
element in ^(M) is a limit of sums and products of <P M applied to test functions 
because ® n S(M)®' 1 is dense in ?/(M). 

2 With this topology the algebra is complete and nuclear, and the algebra product is separately 
continuous. 
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5.3.2 Field equation and commutator 

The Borchers-Uhlmann algebra carries no dynamical information and may therefore 
also be called the off-shell field algebra. In particular, the theory r U is neither causal 
nor does it satisfy the timeslice axiom. To obtain a causal theory that satisfies the 
timeslice axiom, we need to implement a field equation (an equation of motion) that 
induces a Cauchy evolution of the algebra elements and a commutator that ‘separates’ 
causally disjoint algebra elements. 

Therefore, we assign now to every globally hyperbolic spacetime M a natural, 
formally self-adjoint, Green-hyperbolic operator P M : G P (M, C) —* i? p (M,C) such 
that 

i°PM = PlV 0 Vb 

for ever hyperbolic embedding t p : M —* N. Moreover, we define a functor 2s as in 
the previous section. 

As discussed in Sect. 3.6, associated to the Green-hyperbolic operator P M , there 
exists on each globally hyperbolic manifold a unique causal propagator G M . 

The causal propagator is also called the commutator distribution or Pauli-Jordan 
distribution (see also Sect. 3.6.2) because it facilitates the definition of a commutator 
on the algebra ^(M). Namely, let /, h e 2(M) then we define on y/(M) 

= (iG M (/®h),0,...). (5.1) 

Due to the support properties of the causal propagator, this is exactly the right choice 
if one wants to implement Einstein causality. 

The commutator extends to arbitrary elements of 9/{M) in the following way: 
First, we notice that the commutator ought to satisfy the Leibniz rule. Therefore it 
may be seen as a map 

2(M)® n 0 2(M)® m —■» 2>(M)® (n+m_2) 

for n,m> 1, which can be extended (anti-)linearly to ® n 2s(M)® n , a dense subalge¬ 
bra of ?/(M ). Finally, we can extend the resulting commutator continuously to the 
whole algebra 2^(M). Thereby the algebra becomes a Lie algebra. 

The commutator is of immense physical importance. Foremost, it implements 
causality and manifests the principle of locality. Moreover, if the centre of the algebra 
of observables with respect to the commutator is non-trivial, the algebra contains 
unobservables and one cannot justify calling it ‘algebra of observables’. Nevertheless, 
non-trivial centres in the ‘algebra of observables’ can lead to important non-local 
observable effects under spacetime embeddings [194]. 

Note that the commutator (5.1) defined on ^(M) is degenerate if 2(M) flker G M 
is non-trivial and thus it leads to an algebra with a non-trivial centre. This problem 
will be addressed in the following section, where we introduce the so-called field 
algebra. 

5.3.3 Field algebra 

Then, taking the wave operator and the commutator, we can define the (on shell) 
field algebra fF(M) as the unital topological *-algebra given for every M e Obj(Loc) 
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by the quotient 

^'(M) = ^(M)/J r (M), 

where J^M) is the completion of the *-ideal finitely generated for all f,h& 2>(M) by 
(a) the wave equation 

^m(Pm/)~^m(0) 


(b) the commutator relation 


The topology of .^ (M) is the quotient topology with respect to C U (M ). 

Like for the Borchers-Uhlmann algebra, & : Loc — » *Alg defines a functor, where 
S^(M) is the field algebra and the *-algebra homomorphism 

W) [F] = W>*F] 

on all [F] e ^(M), which is naturally induced from ^(t p) via the canonical projection 
[ • ] : ‘j'Z(M) —> ^(M). That these assignments give indeed a covariant functor, relies 
on the naturality of all involved operators. In particular, 

V>*( p m/) = p jvO/’*/) and G m (/ ®h) = G N (ipJ 

for all xp : M — » N and /, fr e S*(M). Note that the field algebra is a Lie algebra, 
where the bracket is simply 

[[F],[H]]=[FH-HF]=[[F,H]] 

for all [F], [Ff] e .iF(M); the centre is trivial by construction. Moreover, we can 
construct a quantum field f> for .5F as the natural transformation f> : 2> -=-» which 
is given for each M e Obj(Loc) by 


4>M — [ ’ ] 0 $M- 

That is, cj) M is the map / —> [(0,/, 0,...)] for all test functions / e 2>(M). 

The following is a standard result, see e.g. [190, Chap. 3.1]: 

Proposition 5.1. The locally covariant theory &, given by the field algebra, satisfies 
both causality and the timeslice axiom. 

Proof. Since .iF(M) is the completion of the algebra generated by f> M , causality 
follows immediately from the support properties of the causal propagator G M . 

For the timeslice axiom we only need to show that the algebra in the whole 
spacetime can be reconstructed from the algebra in a causally convex neighbourhood 
N c M of a Cauchy surface; N may be considered as a spacetime in Loc with a 
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Cauchy embedding into M. Set % e C°°(M) such that % = 1 on J + (IV) \ N and and 
X = 0 on J“(1V) \ N. For every / e 3(M) there exists /' e 2>(M) given by 

/ / = Pm(*G m /) 


such that supp/' c N and 

f - f' = p m((1 - Z)G v ,m/ + ^G a M / ) e kerG M n 3{M) 

The statement follows again because is the completion of the algebra generated 

by 4> m . □ 

If to is a state on the Borchers-Uhlmann algebra ?/(M) for some spacetime M, it 
induces a state for the field algebra FF (M) if it also satisfies the commutation relation 

co(FH) - co(HF) = to([F,H]) 


and the equation of motion 

co ((id ® ® P M 0 ■■■ 0 id)F) = 0 

for all F,H e ?/(M). In that case, the state on ^(M) is defined by the pushforward 
of co by [ • ]. Conversely, a state on SF(M) always induces a state on ?/(M) by the 
pullback via [ • ]. 

5.3.4 Weyl algebra 

The disadvantage of the Borchers-Uhlmann algebra and the field algebra is that they 
are only ^-algebras and not C*-algebras and hence cannot generally be represented 
by an algebra of bounded operators. However, heuristically speaking, we can expo¬ 
nentiate the field algebra to produce a C*-algebra, the Weyl C’"-algebra introduced in 
Sect. 3.2.2. 

In this section, set 

S(M) = (M, C) / ker G M . 

3> extends to a functor from Loc into the category of closed nuclear locally convex 
C vector spaces because G M is continuous (so that ker G M is closed) and transforms 
covariantly under hyperbolic embeddings (c/ the previous section). 

Then define on every spacetime M e Loc the Weyl C*-algebra W{M) obtained 
from the Weyl operators W M : 2>(M) —* W{M) with commutator relation 

W M ([/])W M ([/t]) = exp (^G m ([/] ® [h]))W M ([f +h]) 

for every [f],[h] e 3 (M) and where we denoted also by G M the well-defined 
pushforward of G M to 3{M ) via the canonical quotient map. Again, it may be shown 
that W extends to a functor from Loc to *Alg by the covariance of the commutator 
distribution/ The proof that W satisfies both causality and the timeslice axiom is 
very similar to Prop. 5.1 and will not be repeated. 

3 To be precise, it is also necessary to show that the minimal regular norm behaves in a locally 
covariant way. This follows from the Hahn-Banach theorem. 
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5.3.5 Scalar Klein-Gordon field 

The (free, scalar) Klein-Gordon equation is 

(□ + fR + m 2 )ip = 0, (5.2) 

where <p e 'g(M) is the classical Klein-Gordon field and the parameters f and m > 0 
are the curvature coupling and the mass. 

One distinguishes in particular between two different curvature couplings: min¬ 
imal coupling if f = 0 and conformal coupling if f = 1/6. The reason for naming 
f = 1/6 conformal coupling is that, in the massless case m = 0, (5.2) is invariant 
under conformal isometries, see e.g. [64, 216], Namely, given a conformal embedding 
ip : M —> N with rp*h = Pl 2 g, one finds 

ip* (□ + Ir) p> = n 3 (□ + n~ l rp*ip, 

where ip e 'g(M). That is, if ip solves the massless conformally coupled Klein-Gordon 
equation on (IV, fr), then solves the massless conformally coupled Klein- 

Gordon equation on (M,g). 

The field algebra for the Klein-Gordon field can be constructed exactly as outlined 
above, where we choose 

P M = \3 + E,R + m 2 and S> V (M) = 2)(M, C), 

and denote the resulting functor and the quantum field <p. The Weyl algebra may 
be constructed in a similar way. In case of conformal coupling, the Klein-Gordon field 
can also be quantized as a conformally locally covariant theory [177, 5], 

5.3.6 Proca field 

The field equation for the classical Proca field Z e 12 1 (M) of mass m > 0 is 

(5d + m 2 )Z = 0 (5.3) 

and one can almost immediately see that it is not normally hyperbolic. However, 
applying the codifferential to this equation we find that 5Z = 0 so that (5.3) is 
equivalent to 

(□ + m 2 )Z = 0 


with the constraint 8Z = 0. 

There are two equivalent approaches to quantize this constrained system. Both 
rely on the same fact that the exterior derivative d and the codifferential 5 commute 
with (□ + m 2 ) and thus also with its causal propagator G M [94, 175], 

For first approach [55, 94] we notice that (5.3) is pre-normally hyperbolic so that 
we can construct the causal propagator 

G m = G m ° (m _2 d5 + id ). 
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Accordingly we can perform the construction of the field algebra discussed in 
Sect. 5.3.3 with this propagator. More precisely, we set 

P M = 5d + m 2 and 2> Z (M) = S?q(M, C) 

and follow through with the construction of the field algebra, where we denote the 
resulting locally covariant theory and the corresponding quantum field Z. 

The second possibility, which is related to the framework developed in [115], is 
to consider 


P M = n + m 2 and Ss^M) = {/e T?o(M, C) | 5/= 0} 

Since every section / e 2> Z (M) is coclosed, G M f solves the Proca equation (5.3). 
We can then perform the usual construction for the field algebra and denote the 
corresponding locally covariant theory &' z . 

Proposition 5.2. The locally covariant theories and TF' Z are equivalent. That is, 
cF z (M) ~ SF' Z (M) for every globally hyperbolic M. 

Proof. For every / e 9 Z (M) 

m 2 Z(J ) = Z ( m 2 f — (Sd + m 2 )/) = —Z(5d/) 

and thus Z (2> Z (M)) = Z (2^(M)) • Then it is easy to check that 
(5d + m 2 )f = (□ + m 2 )/ and G M f = G M f 


for all / e 3>' Z (M). □ 

Similarly, two apparently different Weyl algebras can be constructed for the Proca 
field and shown to be equivalent. 



Quantum states 

Summary 

In Sect. 3.2.1 we already introduced some general features of states on *-algebras. In 
this section we will discuss features important or specific to quantum field theory. 

We begin our discussion with the introduction of the n-point distributions 
(Sect. 6.1.1) associated to (some) states of the algebras defined above: the Borchers- 
Uhlmann algebra, the field algebra and the Weyl algebra. Of particular importance 
are states which satisfy the microlocal spectrum condition to be defined in Sect. 6.1.2. 
States which satisfy this constraint on the wavefront set are the so-called Hadamard 
states and their singular part is given by the Hadamard parametrix (Sect. 6.1.3). 

After we introduced these general notions, we will discuss the construction of 
quantum states on particular spacetimes. Due to their importance in cosmology and 
their relative simplicity, we discuss adiabatic and Hadamard states on cosmological 
spacetimes in Sect. 6.2. 

6.1 Preliminaries 

Let (M, g) be a globally hyperbolic spacetime and let us consider, as in 5.3, p-form 
fields. It is important to notice that none of the results here are fundamentally 
restricted to the assumption of p-form field and can be easily generalized. 

6.1.1 n-Point distributions 

Let ?/(M) be the Borchers-Uhlmann algebra of a quantum field theory on M which is 
built on a test function space 2*(M) C L2 q(M). Since 

«f(M) = 0 2s(M)® n , 

TIGNq 

the topological dual is of the form 

®'(M) = Y\ S'(M) Sn , 

72 £ N 0 

where we have used the kernel theorem. In other words, whereas any element of 
2^(M) can be understood as a polynomial, ^(M) also contains power series. It 
follows that the any state <x> on 2ff(M) is uniquely defined by a family (w n )„ eN of 
n-point distributions (also called n-point functions or Wightman functions) co n e 
3) '(M)®". If we denote by <P M the quantum field associated to ^(M), then the 
n-point distributions satisfy 

m n (/i »■■■»/„) = m($m(/i) ■ ■ ■ *m(/J) 
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for all fy, ...,/„ e 2>(M) and neN. 

The connected or truncated n-point distributions co^ of a state co are defined by 
the relation 

> * J = n W w( X r(D- • • • > *r(|r|))» (6.1) 

reP 

where S° n denotes the (ordered) partitions of the set {1,..., n}. Therefore, they can be 
calculated recursively from the n-point distributions. The first two truncated n-point 
distributions are 


w[(xi) = coi(xr), 

0>l(x i,x 2 ) = co 2 (xi,x 2 ) - mi(xi)a>t(x 2 ) 


and a general recursive formula is given by 
CO T n (x 1 ,...,X n ) = COn(*l, 

PeSf' n reP 
\P\>1 

Thanks to the close relation of the Borchers-Uhlmann algebra and the field 
algebra, see the last paragraph of Sect. 5.3.3, the space of states of the field algebra 
is related to a subspace of the space of states for the Borchers-Uhlmann algebra and 
a state to on the field algebra also has associated n-point distributions. These n-point 
distributions naturally satisfy the commutation relations 

0) n (X!, ...,X„) = tO n (Xy, X i+ y,X h . . . , X n ) 

+ to n _ 2 (x 1 ,. .., Xj, x i+1 ,..., 1)> 

where the hats denote omitted points, and are weak solutions of the equations of 
motion 

P M(*;)"nOi, ■■■,x i ,...,x n ) = 0 

for all i e [1.. n]. Therefore, if we denote by the quantum field associated to 
S^(M), the n-point distributions satisfy 


COnify ®---®f n ) = CO Om(/i) ■ ■ ■ # M (/n)) 


independently of the chosen representatives fy,...,f n G 2>(M) of [/J,..., [/„]. 

The definition of n-point distributions for a state on the Weyl algebra W(M ) is 
slightly more involved. n-Point distributions in the algebraic sense only exist for 
strongly regular states as defined in Sect. 3.2.2, see also [23]. In this case they are 
defined by the relation 

CO n (Jy »■■■»/„) = (-i) n 9 00 ( W M (ty [fy]) ■ • • W M (t n [/„])) 

dty-dt n t.= 0 

Clearly, the n-point distributions of a state on the Weyl algebra satisfy the commutation 
relation and the equation of motion. 

A state to is called quasi-free or Gaussian if all its truncated n-point distributions 
vanish for n/ 2, whence it is completely determined by its two-point distribution to 2 . 
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For a quasi-free state, all odd n-point distributions vanish and all even n-point 
distributions are given by 

<^O l5 • • • > * J = Xi ^OcrU)’ x cr(2)) ’ ’ ’ "20a(n—1> *a( n ))> 

(7 

where the sum is over all ordered pairings, i.e., over all permutations a € & n such 
that cr(l) < cr(3) < • • • < cr(n — 1) and cr(l) < cr(2),..., a(n — 1) < cr(n). 


6.1.2 Microlocal spectrum condition 

A quasi-free state to satisfies the microlocal spectrum condition [45, 181, 189] if 

WF(m 2 ) c {(x,x'; £, -£') e T*(M x M) | (x; £) ~ (x'; £') and ^ > 0}, (6.2) 

or, in words, the wavefront set of to 2 is contained in the set of (x, x'; £, — £') e 
T*(M x M) such that x, x' are connected by a lightlike geodesic y with cotangent E, 
at x and Ej is the parallel transport of E, to x' along y (in symbols: (x; E,) 
and E, is future directed (in symbols: E, > 0). That is, (x,x'\ E,, — £') is contained in 
the wavefront set if (x; £) and (x 7 ; E ,') lie on the same future-directed bicharacteristic 
strip generated by u(Ef) = — g(£, Ef). 

The microlocal spectrum condition can also be generalized to states that are not 
quasi-free [45, 191]. States that satisfy the microlocal spectrum condition are called 
(generalized) Hadamard states. 

Let P = □ + B, where B is a scalar function, the potential, and G(x, x 7 ) its causal 
propagator. If the kernel of the two-point distribution to 2 satisfies the commutator 
relation (weakly) 

m 2 (x, x') - to 2 (x', x) = iG(x, x') 


then equality of sets holds in (6.2). If, moreover, the two-point distribution is a 
parametrix of P M , i.e.a weak bisolution up to smooth terms, then it attains the local 1 
Hadamard form in a geodesically convex neighbourhood U c M 


o) 2 (x,x / )= lim 


e —8tt 2 VcTg^xO 
= W(x, X 0 + w(x, X0, 


iz(x,xO cr E (x,x') 

+ v(x,x )ln ■ 


A 2 


• w(x,x ') 


(6.3) 


where we take the weak limit, x, x' e U, A e K is arbitrary and the detailed form of 
the coefficients v, w e F(/\ P (TM) [3 /\ P (TM)) will be discussed in the next section. 
Above we used a ‘vectorized’ van Vleck-Morette determinant 

u(x,x') = A 1/2 (x,x')g [a ^(x,x') ■■■ g la r ]b r(x,x'), 


which is antisymmetrized in the indices a t and parallel transported along the geodesic 
connecting x and x', and the regularized world function 

cr £ (x,x ') = cr(x, x ') + is (t(x) - t(xO) + \e 2 

with t a smooth time function on (M, g) compatible with the time-orientation. 

1 Also a global Hadamard form can be formulated [136], but since the discovery of the microlocal 

spectrum condition this global form has lost its importance. In fact, it was shown in [182] that a state 
that is everywhere locally of Hadamard form is also globally a Hadamard state. 
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6.1.3 The Hadamard parametrix 


The coefficient functions 

/ 1 vh . fcr{x,x')\ k 

vO,X ) = ^X! Vfc ( X,X ^-J ( - 6 ‘ 4 ' ) 

in (6.3) are called Hadamard coefficients and are another example of bitensors. Note 
that the expansion above is an asymptotic expansion in terms of the world function a 
and cannot be expected to converge unless the spacetime is analytic. Although it can 
be turned into a convergent series by replacing the series (6.4) by [24, Chap. 2.5] 

n oo 

Xi v k (x,x')<j(x,x') k + ^ v k( x ,x / )j(a“ 1 cr(x,x / ))^ 
k=0 k=n +1 

for any n e N and some sequence (a k ), a k e (0,1], where y e C“(R) is 1 in a 
neighbourhood of 0 (note that we omitted the scale A), this will not concern us any 
further because we will only ever need a finite number of terms.^ Therefore, we also 
define the truncated local Hadamard parametrix 


W n {x,x'}= lim 


o+ 8u 2 \ cr e (x,x') 


u(x,x') , , , crAx^ 7 ) 

^+£\ fc (x,x>(x,xO k \n 


k =0 


A 2 


One can show that there always exists a n e N 0 such that 


lim 2(3K(x,x / ) - ^(x.xQ) =0 


for all differential operators 2 and n depends on the order of 2. 

The coefficients v k can be recursively calculated by (formally) applying P to 5?; 
One then finds the so-called Hadamard recursion relations ( cf. [69, 96, 169]) 

A 2 Piz = (2V a + G a a - 2)v 0 , (6.5a) 

A 2 Pv fc _! = (2V £J + cr a a + 2 k- 2)fcv fc , (6.5b) 


where we have used the transport operators defined in (1.12). It can be shown 
that the Hadamard coefficients are symmetric in their arguments [103, 155]. To¬ 
gether with the first term in (6.3) the Hadamard coefficients make up the Hadamard 
parametrix ^(x.xQ, which is therefore completely determined by the differential 
operator P and the geometry of the spacetime. 

The covariant expansion of the Hadamard coefficients can be efficiently cal¬ 
culated using the Avramidi method described in Sect. 1.4.4 using the transport 
equations (6.5). If one is only interested in the coincidence limits, one can directly 
take the limit in (6.5) to find (omitting necessary Kronecker deltas originating from 
coincidence limits of parallel propagators) 



1 

2 



[v fc ] = 


1 

2(1 - k)k 


[Pvik-rL 


k> 1. 


2 If we inserted this modification into (6.3), the two-point distribution would not be an exact (weak) 
solution of P any more, but only up to a smooth biscalar, i.e., it would only be a parametrix. 
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Note that [v 0 ] vanishes for a conformally coupled massless scalar field. The coinci¬ 
dence limit of v 1 cannot be found in this easy way and must be calculated directly. 
After a lengthy calculation using pencil and paper or a (fast) calculation using a 
tensor algebra software, one obtains (again omitting Kronecker deltas) 



Different from v(x, x'), the symmetric bitensor w(x, x') is not directly determined 
by the geometry or a differential operator. Instead the term w(x, x') reflects the 
freedom in the choice of the state. Writing the asymptotic expansion 



k =0 


we notice that the freedom to choose a state is completely encoded in the first 
coefficient w 0 and the remaining coefficients obey the recursion relation [69, 96] 


A 2 Pw fc — 2 (k + l)V (J w fc+1 + 2 k(k + l)w fc+1 + (k + l)w k+1 cr a a 
+ 2V cr v fc+1 - 2(2 k + l)v fc+1 + v k+1 cr a a . 


A common choice is to set w 0 = 0 as in [219]. In any case, w 0 must be chosen such 
that w is symmetric. 

6.2 Construction of states on cosmological spacetimes 

Explicit examples of quantum states are known only on a small class of highly 
symmetric spacetimes. Below we will first discuss the so-called Bunch-Davies state 
[10, 51, 198], which can be considered the vacuum state of de Sitter spacetime. Then 
we study a construction of states on FLRW spacetimes due to Parker [172] called 
adiabatic states. Although adiabatic states are in general not Hadamard, indeed only 
adiabatic states of infinite order satisfy the microlocal spectrum condition [134], 
they can be considered approximate Hadamard states and have proven to be very 
useful thanks to their relatively straightforward construction. Since we will make 
extensive use of adiabatic states when we discuss the semiclassical Einstein equation 
on cosmological backgrounds in Chap. 8, they will be treated in some detail below. 
Moreover, we will introduce the states of low energy by Olbermann [165], which are 
constructed via a careful Bogoliubov transformation from adiabatic states. 

6.2.1 Bunch-Davies state 

A distinguished Hadamard state for the (massive) scalar field on de Sitter spacetime 
is the Bunch-Davies state [10, 51, 198]. It is the unique pure, quasi-free Hadamard 
state invariant under the symmetries of de Sitter spacetime. Note that equations of 
motion for the scalar field on de Sitter spacetime are 


□</> + (12£H 2 + m 2 )cp = 0, 


where H is the Hubble constant, m the mass of the scalar field and c the curvature 
coupling, cf. Sect. 5.3.5. Therefore the curvature coupling £ acts like a mass and 
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we set M 2 = 12£H 2 + m 2 . The Bunch-Davies state is also a Hadamard state in the 
limit M = 0 but in that case it fails to be invariant under the symmetries of de Sitter 
spacetime [10, 11]; below we assume M > 0. 

Using the function Z defined in (2.15), the Bunch-Davies state is the quasifree 
state given by the two-point distribution 

co 2 {x,x ) = co 2 (Z(x,x )) = ----- 2 Fi(v + ,v_;2;i(l + Z)), (6.7) 

8ttcos(Ttv) 1 

where 2 F\ is the analytically continued hypergeometric function and with 

3 [9 M 2 

v+ = - ± v and v = \ - 

± 2 V 4 H 2 

We can rewrite (6.7) into a form which exhibits the Hadamard nature of the state 
more clearly. In fact, using well-known transforms [166, Eq. (15.8.10)] of the 
hypergeometric function 2 F 1 , one can show 

zj 2 jt/r2 _prr2 . . 

0> 2 (Z) = — 2(1 - Z)- 1 + 2 fv(i( 1 - Z)) In (1(1 - Z)) + w(|(l - Z))) 

8tt 8 tc v. z z z J 

for |Z| < 1 (spacelike separated points) with 


v(z) = 2 f’iO + ,v_;2;z), 

w(z) = V v + + fc ) + F( v - + fc) - f (k + 1) - f{k + 2)) z k , 

t=Z fc!(fc+ 1)! 


where f is the digamma-function. 

In the cosmological chart of de Sitter spacetime the function Z attains the simple 
form (2.17) and the spatial sections are flat. Therefore, one can represent a> 2 as a 
spatial Fourier transformation with respect to x — x'. Indeed, using known integrals 
of (modified) Bessel functions [107, §6.672], a lengthy calculation shows [198] 
(omitting again the e-prescription) 


co 2 (x,x') = 


H 2 (tt / ) 3/2 


32-nZ 


- 1 tImv H (D(_ fcT)H @(_ fcT /) e ifc-(x—xO d £, 


( 6 . 8 ) 


where x = (t, x) and x' = (t x , pc') in the conformal coordinates and H®, H® are the 
Hankel functions of first and second kind. 


6.2.2 Homogeneous and isotropic states 

It is usually reasonable to restrict ones attention to states that respect the symmetry 
of the background spacetime. Under this assumption, a state on a FLRW spacetime 
should be both homogeneous and isotropic. That is, if the state is also quasifree, its 
two-point distribution needs to satisfy 

co 2 (x, x') = co 2 (t, t', x-x) = co 2 { t, t', x - x), 


3 More precisely, one should replace Z by Z + ie(t(x) — t(x')), where t is a smooth time function, 
and take the limit e —» 0 + . Note that 2 F1 has a branch cut from 1 to 00. 










6.2. Construction of states on cosmological spacetimes 121 


where x = (t, x) = (t, x ) and x' = (V, x') = (t'', x ') with respect to cosmological or 
conformal time. 

Under a certain relatively weak continuity assumption on the two-point distribu¬ 
tion (such that they may be represented as bounded operators on a certain Hilbert 
space and the Riesz representation theorem can be used [146]), it was shown in 
[146, 196] that every quasifree, homogeneous and isotropic state for the scalar field is 
of the form 4 5 


m 2 (x,xO = 


(2tt) 3 a(T)a(T / ) 


(S(fc)S fc (T)S t (T / ) 

3 

+ (S(fc) + l)S fc (T)S fc (T')) dfc, 


(6.9) 


where k = |k| and S(fc) is a non-negative (almost everywhere), polynomially bounded 
function in L 1 (Rt); for pure states 5 = 0. Moreover, the modes S k are required to 
satisfy the mode equation of motion 


(d 2 + col)S k (t) = 0, col = k 2 + (£- |)a(T) 2 R + a(T) 2 m 2 , (6.10) 

and the Wronski-determinant condition 1 


s k s' k 


s' k s k = i, 


( 6 . 11 ) 


where both S fc and S' k are polynomially bounded in k. States constructed in this 
manner are in general not of Hadamard type. 

Two important examples of pure Hadamard states expressible in the mode form 
above are the Minkowski vacuum state on Minkowski spacetime 


1 

(2u) 3 



1 

2 E(fc) 




with E(k) 2 = k 2 + m 2 , and the Bunch-Davies state on the cosmological patch of de 
Sitter spacetime (6.8). Interesting examples of non-pure states are the approximate 
KMS states at inverse temperature 13 for the conformally coupled scalar field [57] 


(2tt) 3 a(T)a(T / ) 


S k ('t)Sk(T') 
Ql3k F — ^ 


1 — e~P k F J 


e ik-(x-x'J 


with k F = \/k 2 + a(T F ) 2 m 2 for some ‘freeze-out’ time t f . These states are KMS states 
if the spacetime admits a global timelike Killing vector field which is a symmetry of 
the state; they are Hadamard states if the pure state specified by the modes S k is 
already a Hadamard state [57]. 

Given fixed reference modes x k that satisfy (6.10) and (6.11), all other possible 
mode solutions S k can be constructed via a Bogoliubov transformation, i.e., 


Sk= A ( k )Xk + B(k)x k with |A(k)| 2 - |B(k)| 2 = 1, 


4 Note that we omit here and below the necessary e-regularization of the integral, where one 
multiplies the integrand with e~ ek and considers the weak limit e —» 0 + . 

5 Imposing the Wronski-determinant condition guarantees that the imaginary part of the two-point 
distribution is given by half the commutator distribution. It is sufficient to impose this condition at one 
instance in time. 
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where A(k) and B(k) are such that k —* S fc and fc >-» S' k are (essentially) polynomially 
bounded, measurable functions. Note that changing S k by a phase does not affect 
the state and therefore B(k) can always be chosen to be real. The choice of A and B 
thus corresponds to two degrees of freedom, e.g., the phase of A and the modulus 
of B. If the reference modes % k specify a pure Hadamard state, one can show that 
the modes S k with the mixing H(k) specify a Hadamard state as well if and only if 
(in addition to the conditions above) k n B{k ) and k n S(k ) are in L 1 (Mq ) for all n e N 
and ArgA — Arg B is measurable [178, 221], An important example of a Bogoliubov 
transformation of a Hadamard state that (clearly) does not give a Hadamard state are 
the a-vacua associated to the Bunch-Davies state for which A = sinh a and B = cosh a 
with a € R. 


6.2.3 Adiabatic states 


Any solution of (6.10) and (6.11) is of the form 6 


SfcO) 


Pk ^ piS/cM 

V2 


with 


dk( T) 


f T 


p fc (rj) 2 dp, 


( 6 . 12 ) 


where p k satisfies the differential equation 

p'k = (pfc 4 - w DPfc = K 2 - <4)Pk- ( 6 - 13 ) 

Modes W k = a k e 1 ^ k /V2 of the form (6.12) with arbitrary cr k that do not satisfy the 
differential equation (6.13) can be used to specify initial values for solutions S k of 
the mode equation (6.10), i.e., 


S fc (To) = W fc (To), S'(To) = W fc '(T 0 ) 


at some initial time t 0 . Using an idea of Parker [172], it can be shown that the W k 
yield the solution S k via a Bogoliubov-like transformation 

S fc (T)=A(T)W fc (T)+B(T)W fc (T) (6.14) 


with time-dependent coefficients given by 


A(t) = 1 — i 


r t 

G(p) (^A(p) + B(p) e~ 2l ^ k ^ v ^ dp, 


J T 0 


B( t) 


rr 

- A'(v)e 2 ^ kM dr] 

J T 0 


(6.15a) 

(6.15b) 


and 2 G = a ~ 2 — co 2 a 2 — a k a ", where we have suppressed the k-dependence of A,B 
and G in all four equations above. Applying arguments from the analysis of Volterra 
integrals, it can be shown that 1 — A, A', B and B' have the same large k behaviour 
as G ( cf. [146, 165]). As a consequence, the modes S k and their derivatives have 
(almost) the same asymptotic behaviour as the modes W k if cr k looks asymptotically 
like a> k lpl : 

6 The lower limit in the integration is arbitrary as it gives a constant phase. 
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Proposition 6.1. Suppose d\c 7 k = d'{ a> k 1 ^ 2 ) + 0{k 9 ^ 2 )for all l = 0,1,2 such that 
G = 0{k~ m ) for some m> 3. The modulus p k of the modes S k satisfies 

Pk = a k + 0 (/r 1 / 2 --), 8fp k = 8fa k + O {k 3 ' 2 ~ m ) 


for all n = 0 , 1 , 2 . 

Proof. First note that the assumed bounds of cr k and cr" yield 
2 G = a- 2 -co 2 a 2 -a fc < = 0(fc- 3 ), 

which is consistent with the assumption on G. Further, recall that l — A, A', B and B' 
are 0{k~ m ), too. We then derive from (6.14) that 

p 2 = |A + £e -2i ^ fe | 2 (7 2 = a 2 k + 0{k~ 1 ~ m ) 

and thus p k = <j k + 0(k _1 / 2 ~ n ). Next we use that p k satisfies the differential 
equation (6.13) to find 

p k -cr , ' = ( < pf 4 -aj 2 k )p k - {erf 4 — co 2 )cr k + 2Gaf 1 = 0(fc 3/2_m ), 


whereby we obtain the estimate for p'f and, after an integration in time, also that 
for p[. □ 


In summary, the asymptotic behaviour of the initial values given by co k fixes the 
asymptotic behaviour of the solutions to k . 

We can now construct adiabatic states as in [146, 172] by specifying appropriate 
initial values for (6.10) respectively (6.13): Making a WKB-like Ansatz, one finds the 
adiabatic modes of Parker [172], Namely, the adiabatic modes W fc = Wjf^ of order n 
are modes of the form (6.12) with a k = \ given iteratively via 7 


(crE n + 1 ))“ 4 A co 


&0 


+ * with (afr 4 = co 


cr 


k- 


The adiabatic modes Wf 1 ’ are then used to specify initial values for the mode equa¬ 
tion (6.10), e.g., by solving the integral equations (6.15). 

A useful result on the asymptotic behaviour of the adiabatic modes is stated in 
[146, Lem. 3.2], Using the fact that co' k = 0(k -1 ), one can easily improve this lemma 
to obtain for all n e N 0 and m € N 


o-W = 0(k- 1/2 ), 
Gf = 0{k- S ' 2 ), 


ef = 0[k~ 2{n+l) ). 


dr r ‘ 


4 n) = 0(k“ 2( ' 1+1) ) 


(6.16) 


as k —* oo and where (cr^) 4 = {erf ^) 4 (1 + eff). The asymptotic behaviour of the 
coefficients (6.15) for adiabatic states was analyzed in [146, 165], It can be found, 
using the improved bounds (6.16), that they satisfy 

1 —A{k, t) = 0(k -2n-3 ), B{k, t) = 0(k -2n-3 ), 

A'(k, t) = O {k ~ 2n ~ 3 ), B'{k, t ) = O {k~ 2n ~ 3 ) 


7 The notation used here can be transformed into the usual one by setting erf) = (fijA) 1 ^ 2 . 
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as k —» oo. 

These results can be seen as a starting point to show the relation between adiabatic 
states of a certain order and Hadamard states. Indeed, one can show [134] that 
adiabatic states of infinite order are Hadamard states and that adiabatic states of a 
finite order satisfy a microlocal spectrum condition on a certain Sobolev wavefront 
set 8 of the two-point distribution. 


6.2.4 An adiabatic state for conformal coupling 

Let study the construction of the (adiabatic) states already considered in [178, 3], see 
also [14, 15]. For a (massive) conformally coupled scalar field (f = 1/6) the initial 
values for an adiabatic state of order zero can be taken to be 

Z fc (T 0 ) = — L e ifc ° T °, *'(To) = -p= e ifc ° T °, 

\fZko ^/2ko 

with kg = (12®) 2 = k 2 + a(T 0 ) 2 m 2 . Note that these initial values are essentially a 
conformal transformation of the modes of the Minkowski vacuum. 

It is possible solve the mode equation (6.10) with these initial values perturba- 
tively. For this purpose, define the potential V(t) = m 2 (a( t) 2 — a( t 0 ) 2 ) and make 
the recursive Ansatz Xk( z ) = %^\z) with the recurrence relation 

ir(T) + ^f(T) = -V(t)li"- 1) (T) (6.17) 


with initial condition 


zf(T) = ^e ifc ° T 


\J 2 ko 


(6.18) 


The mode equation (6.10) is then solved as described in the proof of the following 
proposition: 

Proposition 6.2. The recurrence relation (6.17) is solved iteratively (for z > r 0 ) by 


r T 




sin (k 0 (i? - t)) 


V(v)Xk ®0j) dl 7 


(6.19) 


TO 


and the sum j^(t) = J] n x^\z) converges. 

Proof. Consider for each k the retarded propagator of <9 2 + k g given by 


A-et,k(/)Oo,T) = 


sin(ko(T - rj)) 


/ (rj)drj, t>t 0 , 


T 0 


for all / e C°(/), where I is the domain of the conformal time. Applying A ret k 
to (6.17), it can be solved as 


r t 




sin (k 0 (r? - t)) 


v(v)Xk 'Xrid'n 


T 0 


8 Sobolev wavefront sets are very similar to the usual wavefront set, but instead of using smooth 
functions at the foundation of the definition, one uses functions from a Sobolev set of a certain order. 
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for t > t 0 . 

The straightforward estimates 

r T r T 

\x^\<m 2 (T-rj)|y(r))^ n_1) (tj)|d 17 

T 0 ^ T 0 

can be iterated using the initial value xf 1 = ( 2 /c 0 )~ 1/2 and the standard ‘trick’ of 
extending the integration of a symmetric function over a time-ordered domain to an 
integration over a symmetric domain by diving through the appropriate factorial ( cf. 
[178, Prop. 4.4]). Combining the two estimates, this gives 

,, 1 f m 2 f 

I4°l < ~j=^ T- 

\/2k 0 n\ V k 0 J T 

for any 0 < l < n. Therefore the sum Xk(. T ) = Xm xjfC'c) converges absolutely. □ 

Equivalent results can be found in [3, Sect. 2.1], [178, Prop. 4.4] and also [14]. It 
is clear, that the recurrence relation can be solved in the same for t < t 0 by applying 
the advanced propagator. 

Remark 6.3. The partial modes xcan be computed as in Prop. 6.2 even if the metric 
(equivalently the scale factor) is not smooth. If the scale factor is C°, the resulting 
mode Xk w M be at least C 2 . This relies crucially on the fact that the curvature, which is 
not well-defined if a is not at least C 2 , does not enter the mode equation (6.10). 


|v(tj)| drj 


m 


- T 

(T-i?)|v(i7)|dp 


J T 0 


( 6 . 20 ) 



6.2.5 States of low energy 

Let us define the (unregularized) energy density per mode S k as 

P(Sfc, Sk) = A (SfcS; + (6f - 1) aH(S k S k y 

2a y ( 6 . 21 ) 

+ (k 2 + a 2 m 2 - (6f - 1 )a 2 H 2 )S k S k J. 

For now, we will not interpret this quantity in any way and leave its derivation to 
Sect. 7.3. 

Given reference modes Xk and Bogoliubov coefficients A, B, the energy density 
per mode S k = Axk + BYk i s related to the energy density per reference mode Xk by 

l(p(S k ,S k )~ p(Xk,Xk)) = \B\ 2 piXk,Jk) + B.e(AB p(xk, Xk)) (6.22) 

One can now attempt to minimize the energy density per mode by varying the 
Bogoliubov coefficients and we find that: 


Proposition 6.4. The energy density per mode at a fixed instance of time is minimal if 
and only if the Bogoliubov coefficients are given by (up to unitary equivalence) 9 


Arg A(k) = tt - Arg (p(* fc , Xk))> 
p(Xk,Xk) 


B(k) = 


2 \/p{Xk,Xk) 2 - \p(Xk,Xk)\ 2 


l 


1/2 


(6.23a) 

(6.23b) 


9 Recall that, without loss of generality, we can always choose B to be real and positive such that A 
is completely determined by its phase. 
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and p(% k ,Xk) 2 > \p(Xk> Xk) I 2 - The inequality is satisfied for allk> 0 if and only if 
I 4 m 2 

y 1 + -^ 2 - ± 1 > ±12? or H = 0, (6.24) 

i.e., in particular whenever 0 < ? < 1/6. 

Proof. For fixed B > 0, (6.22) is minimized by ArgA = tt—Arg (p(Xk> Zfc)) so that the 
second summand is maximally negative. Therefore, minimizing (6.22) is equivalent 
to finding the minima of 

B 2 p(Xk,Xk) + ^ (ABp(xk, Xk)) =B 2 p(Xk,Xk)~ V 1 +B 2 B\p(x k , Xk)\- 

Differentiating this expression by B, one finds that an extremum exists on the positive 
real axis only if p(Xk’Yk) 2 > I p(.Xk> Xk) I 2 and that its locus is given by (6.23b); it is 
easy to see that this is indeed an minimum. 

Inserting the definition (6.21) into the condition p(Xk>~Xk) 2 > \p(.Xk> Xk)\ 2 > w e 
find that it is equivalent to 

k 2 + a 2 (m 2 + 6(1 - 6 f)fH 2 ) > 0. 


If this conditions is to hold for all k > 0, then (6.24) must be satisfied. □ 


Instead of trying to minimize the energy density per mode at an instant, states 
of low energy are constructed by minimizing the smeared energy density per mode. 
That is, by minimizing 


/(ir) 2 (p(S fc ,S fc ) - p(Xk,Xk)) dT 
Jl 


r 

/(T) 2 ^|B| 2 p(^,Yfc) + Re(ABp(j fc ,j fc ))) dr 
J I 


(6.25) 


for a fixed smearing function / e C“(l), where Id R is the domain of the conformal 
time coordinate. 

This minimization was performed for minimally coupled scalar fields in [165] to 
find the so-called states of low energy. It can be shown that the states of low energy 
satisfy the microlocal spectrum condition and thus they are Hadamard states. The 
arguments presented in [165] can be straightforwardly repeated for the conformally 
coupled scalar field to find states of low energy, which are Hadamard states too. In 
both cases the Bogoliubov coefficients are given as in (6.23) with the replacements 


p(Xk,Xk) 


r 

f^) 2 p(Xk,Xk) dT and p(Xk,Xk) 
Jl 


r 

f^) 2 p(Xk,Xk) dT. 
Ji 


There are good reasons to believe the following: 


Conjecture 6.5. States of low energy for arbitrary smearing function, mass and scale 
factor exist only in the curvature coupling range 0 < ? < 1/6. For all such ? the state 
satisfies the microlocal spectrum condition. 
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The crucial point in proving this conjecture is to show that (6.25) has a minimum. 
This can be shown similar to [165] for the minimally coupled case and the conformally 
coupled case. For other values of £ the proof is more difficult, but by continuity it is 
clear from Prop. 6.4 that for some smearing functions states of low energy exist in 
the interval E, e [0,1/6] but not outside that range. Namely, if (/„) is a sequence of 
functions such that f 2 converges weakly to the delta distribution, then there exists N 
such that a state of low energy exists for all f m , m> N because a minimum exists for 
f 2 = 5 by Prop. 6.4. Once existence is show, one can expect that the state satisfies 
the microlocal spectrum condition using proofs analogous to those in [165], 


6.3 Holographic construction of Hadamard states 

In the absence of a global timelike Killing field on a generic globally hyperbolic 
spacetime it is difficult to find physically well-motivated quantum states. Therefore, 
in recent years, a lot of effort was put into the construction of proper Hadamard 
states on non-trivial spacetimes. A promising method is the ‘holographic’ construction 
of Hadamard states on characteristic surfaces introduced in [64, 157, 159]. The holo¬ 
graphic method has been applied to construct Hadamard states for the conformally 
coupled, massless scalar field [64, 157, 159], the Weyl (massless Dirac) field [58, 
112], the vector potential [1, 5] and linearized gravity [29] on asymptotically flat 
spacetimes and cosmological backgrounds [61, 62], It was also used to construct 
local Hadamard states in lightcones in [65], 

Forgetting for a moment the application of the bulk-to-boundary construction to 
asymptotically flat spacetimes and limiting ourselves to the scalar field, it may be 
roughly sketched as follows (see also [106]). Let (M, g) be a globally manifold with 
a distinguished point p such that the future lightcone of p satisfies some technical 
conditions. The it is possible to construct on the lightcone (without the tip and as 
a manifold on its own) a positive bidistribution A on all functions on the lightcone 
that are compactly supported to the future and falls off sufficiently fast to the past, 
such that the antisymmetric part of A agrees with the pullback of the commutator 
distribution on the whole spacetime, and the wavefront set of A is of positive frequency 
with respect to the future-directed lightlike geodesics through p. This bidistribution 
has all the necessary properties to define a state for a quantum field theory on the 
lightcone. Moreover, taking any compactly supported function in the interior of the 
lightcone of p, it can be mapped to a function on the lightcone using the advanced 
propagator and a pullback such that the resulting function on the lightcone is compact 
towards the future and has good fall-off properties towards the past of the lightcone. 
This way one obtains the so-called bulk-to-boundary (projection) map. Pulling back 
all functions in the interior of the lightcone to the boundary of the lightcone using 
this map, one thus finds a state for the scalar field restricted to the interior of the 
lightcone. Applying the propagation of singularities theorem it is possible to show 
that the resulting state satisfies the microlocal spectrum condition. 

In a second step, on may construct Hadamard states for conformally invariant 
scalar field on asymptotically flat spacetimes with globally hyperbolic unphysical 
spacetimes. First, one notices that boundary of the conformal completion of an 
asymptotically flat spacetime in the unphysical spacetimes satisfies all the necessary 
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technical conditions. Then one can compose the bulk-to-boundary map inside the 
unphysical spacetime with the (non-unique) conformal transformation associated 
with the asymptotically flat spacetime, to find a state for the conformally invariant 
scalar field. Since conformal transformation leave lightlike geodesics invariant, also 
this state is of Hadamard form. 

In [1, 5] this construction was generalized to the electromagnetic vector potential. 
Also in this more complicated case a bulk-to-boundary construction of Hadamard 
states was be found, but it involves careful use of the gauge freedom of the vector 
potential to construct a positive state. Otherwise, the construction remains largely 
unchanged. 


Semiclassical gravity 


It is shown in a quite general manner that the quantization of a given 
system implies also the quantization of any other system to which it can be 
coupled. — Bryce S. DeWitt, 

in “Gravitation: An Introduction to Current Research” (1962), p. 272 

Oh gravity, thou art a heartless bitch. 

— Sheldon Cooper, Season 1, Episode 2, The Big Bang Theory 





The semiclassical Einstein equation 


7.1 Introduction 

The equation 

G ab + Ag ab = co(:T ab :) (7.1) 

is called the semiclassical Einstein equation. 1 It is obtained from the ordinary Einstein 
equation by replacing the classical stress-energy tensor with the (normal ordered) 
expectation value of the stress-energy tensor of a quantum field in a suitable quan¬ 
tum state co. Many developments in quantum field theory on curved spacetimes 
were driven by problems related to the quantum stress-energy tensor. See also the 
monographs [36, 104, 112, 217] for an overview of the subject. 

The semiclassical Einstein equation is usually understood as an equation that 
describes physics midway between the classical regime covered by the Einstein 
equation (2.9) and a full-fledged, but still elusive, quantum gravity. Namely, in the 
semiclassical Einstein equation one takes into account that the ‘matter’ content of the 
universe is fundamentally of quantum nature as described by quantum field theory 
on curved spacetimes, whereas the background structure which is the spacetime is 
treated on a classical level and is not separately quantized. 

On the right-hand side one usually considers only Hadamard states. The reason for 
restricting to Hadamard states is that only for Hadamard states the higher moments 

u(:T ab (x): :T ab (x):) etc. 

can be defined. This is due to the fact that the n-point distributions of a state are 
distributions and thus they cannot simply be multiplied ( cf. Sect. 3.5.7). Since the 
two-point distributions of Hadamard states satisfy the microlocal spectrum condition, 
their wavefront set is contained inside a convex cone in T*(M xM) and hence powers 
of the n-point distribution are well-defined distributions. This will be discussed in 
more detail in Chap. 9. 

7.2 The stress-energy tensor 

While the left-hand side of the semiclassical Einstein equation remains unchanged 
with respect to the ordinary Einstein equation, the right-hand side changes quite 
dramatically. Namely, the classical stress-energy tensor T ab is replaced by a the 
expectation value of a quantum observable :T ab : in a certain state co. For this 
expression to be mathematically consistent, we need to require the conservation of 
the quantum stress-energy tensor, i.e., V a : T ab : =0. 

Remember that we chose units such that 8 tiG = c = 1. 
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7.2.1 The stress-energy tensor of the Klein-Gordon field 

We do not aim at discussing the semiclassical Einstein equation in all possible gener¬ 
ality. Instead we restrict our discussion to the semiclassical Einstein equation sourced 
by a scalar field. The classical stress-energy tensor of a Klein-Gordon field <p with 
equation of motion 

Pip = (□ + fR + m 2 )ip = 0 


may be written as [122] 

Tab = |v a V b cp 2 + \g ab U^ 2 - <pV a V b tp + |g ai) g cd ipV c V d ip 

24 2 (7.2) 

+ ?(G afe - V a V b - g ab D)ip 2 - |g ab m 2 ip 2 . 

It may be obtained by varying the classical action of the scalar field with respect 
to the metric [216, App. E], The quantum stress-energy tensor is obtained from the 
classical expression (7.2) by replacing products of classical fields by Wick products of 
quantum fields, t.e., 

■Tab- = |V Q V b :ip 2 : + |g ab D:ip 2 : - :ipV a V b ip: + |g a bg cd :<P V c V d <p: 

4 (7.3) 

+ £(G a& - V a V b -g ab U):(p 2 : - \g ah m 2 :Cp 2 \. 

This expression is not obviously conserved as 

V a :T ab : = -:(V b ip)(Pip): 

is not necessarily vanishing even if the Wick square was a solution of the equations of 
motion. Nevertheless, either by a judicious choice in the renormalization freedom of 
:cp 2 : and :ipV a V b cp: [122] or, equivalently, by a redefinition of the quantum stress- 
energy tensor [154], a conserved quantum stress-energy tensor can be found; here 
we follow the approach of Hollands and Wald. While the renormalization freedom 
can be used to find a conserved stress-energy tensor, it is not possible to impose the 
equations of motions on a locally covariant normal ordering prescription [122], 

7.2.2 Renormalization of the stress-energy tensor 

The renormalization freedom of :<p 2 : and :(p V a V b ip: is spanned by m 2 ,R and 
gab^ 4 , g ab m 2 R, m 2 R ab , V a V b R, g ab DR, □ R ab , g ab R 2 , 

KR ab > R ac RC b> gab R cd RCd > RLdR cadb> gab R cdef Rtde f ■ 

We have to split this renormalization freedom into two classes: (a) combinations 
of terms that are conserved and represent a true renormalization freedom, and (b) 
combinations of terms that are not conserved and need to be fixed to produce a 
conserved :T ab :. 

Denote by I ab and J ab the two conserved curvature tensors of derivative order 4: 

lab = 2 RRab ~ 2 V a V b R - |g ab ( R 2 + 4QR) , 

Jab = 2 RCdR cadb ~'^a'^b R ~ 0 R ab ~ |gab + Off) . 

The following is often stated in the form of a conjecture (e.g., in [219]): 
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Proposition 7.1. I ab and J ab span the whole space of conserved fourth order local 
curvature tensors. 

Proof. It is an easy task to confirm this statement by a direct computation along the 
lines of [68] : Taking the linear span of all fourth order curvature tensors 

'Va'^bR’ SabOR, i3R ab , g a bR 2 > RR a b> R ac RC b> 
g ab R cd R Cd , R Cd R cadb , gabRcdefR Cdef , 

one can show that any covariantly conserved combination C ab must be of the form 

Cab = «iV a V b R - a 2 g ab DR + 2(a x + a 2 )DR ab - |(ai + 2 a 2 )g ab R 2 

+ ( a l + 2ci2 )RRab + ( a l + a 2)gabR-cdR id ~ 4(a x + Ct 2 )R cd R cadb , 

i.e., one obtains (for general metrics) a two-dimensional solution space. For a 1 = 
—2, a 2 = 2 and a x = —1 ,a 2 = 1/2 we recover the tensors I ab and J ab , respec¬ 
tively. □ 

Remark 7.2. In conformally flat spacetimes (e.g., FLRW spacetimes) the Weyl tensor 
vanishes and thus the solution space reduces to one dimension as the two tensors become 
proportional: I ab = 3 J ab . On the level of traces this proportionality holds for all metrics, 
namely, I a a = 3 J a a = —6QR. 

We therefore find that the conserved renormalization freedom of : T ab : is spanned 
by m 4 g ab , m 2 G ab , I ab , J ab . The remaining terms renormalization parameters are 
fixed by the requirement of :T ab : to be conserved. 

7.2.3 Point-splitting regularization of the stress-energy tensor 

Up to the renormalization freedom, a normal ordering prescription for the stress- 
energy tensor is given by the Hadamard point-splitting method. Given two linear (pos¬ 
sibly tensorial) differential operators 9) 1; 3) 2 , the Hadamard point-splitting method 
yields the expectation value of f < 3> 1 (p)(fb 2 (p): by seperating points, regularizing and 
then taking the coincidence limit, that is 

a)(:(2S 1 ^)(9i2^) : ) = lim (m 2 (x, x) - W{x,xf) = 

x'—*x 

where acts on x ' anc l is (implicitly) parallel transported during the limit x' —* x. 

In the Hadamard point-splitting approach the stress-energy tensor in a state to of 
sufficient regularity is thus calculated as 

m(: T ab :) = (fl ab [w] + ST 6 cd [V c V d w]) + [ Vl ]g ab 

8u z 4tt z 

+ C 1 m4 gab + c 2 m2 C ab + C 3 I ab + C 4 J a b, 

where 3~ ab and 3~ ab cd are the differential operators acting, respectively, on :(p 2 : and 
:(pV a V b Ip: in (7.3): 

%b = l^a^b + \g ab O+ ?(G a b - V a V & - g ab \3) - \g ab m 2 , 

cr cd j_ jrc <rd , 1 _ „cd 

&ab ~ ~ 5 a 5 b + 2 Sabg ■ 
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Furthermore, c ; are dimensionless (renormalization) constants fixed once for all 
spacetimes 2 * and the addition of the Hadamard coefficient [v-J (see (6.6) for an 
explicit expression) makes the quantum stress-energy tensor conserved because 

lim = --\vjvj], 

x'^x 4tc z 

Observe that c 1 m 4 g ab can be interpreted as a renormalization of the cosmological 
constant and c 2 m 2 G ab corresponds to a renormalization of Newton’s gravitational 
constant G; the remaining two terms have no classical interpretation. 

7.2.4 Trace of the stress-energy tensor 

Because of its simple form, a first step towards analyzing the stress-energy tensor of a 
scalar field is often the study its trace, which is given by 

:T:=g ab :T ab : = -m 2 :if 2 : +3 {\-^)U:(p 2 -. - :(pV(p:. (7.4) 

It follows that the trace of the stress-energy tensor is calculated via point-splitting as 

m(:r:) = -fm 2 -3(^-^)nl —^[w] -I- 

V ^ w i 8 t 2 4u 2 1 (7.5) 

+ Ac x m 4 — c 2 m 2 R - (6c 3 + 2c 4 )DR, 
where c ; are the same constants as above and we used 

lim P^Oxr,*') = [vi]. 

4tc z 

Equations (7.4) and (7.5) clearly show the so-called trace anomaly [219]. Namely, 
because the normally ordered quantum field does not satisfy the equations of motion, 
the massless, conformally coupled scalar field (m = 0 and £ = 1/6) has non-vanishing 
trace of the stress-energy tensor although it is conformally invariant. It is not possible 
to remove the trace anomaly by a judicious choice of the renormalization constants 
because [vj] is not a polynomial of m 4 , m 2 R and DU. The trace anomaly is a distinct 
feature of the quantum theory and does not appear in a classical theory because the 
classical fields are solutions of the equation of motion. 

7.3 The semiclassical Friedmann equations 

On FLRW spacetimes (M, g) the classical Einstein equation (2.9) simplifies signif¬ 
icantly to the first and second Friedmann equation (2.18) and (2.19). Since the 
left-hand side remains unchanged when crossing over to the semiclassical Einstein 
equation, also the semiclassical equations must simplify in an analogue way for every 
state that satisfies the equation. Whence one obtains the semiclassical Friedmann 
equations 

6H 2 +R = 2co(\p:) + 2A, (7.6a) 

6(H + H 2 ) =-m(:p: + 3 :p:) + 271 = —m(: T: + 2:p:) + 2A, (7.6b) 

2 c 1 ,c 2 are due to the renormalization of :(p 2 : and c 3 ,c 4 correspond to the renormalization freedom 

of fyV a V h <fy 
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where the quantum energy-density : p : and the quantum pressure :p : are constructed 
out of the quantum stress-energy tensor just like their classical analogues are obtained 
from the classical stress-energy tensor. Henceforth we will restrict again to flat 
FLRW spacetimes but similar statements can also be made in the case of elliptic and 
hyperbolic spatial sections. 

States that satisfy the semiclassical Einstein equation need to respect the symme¬ 
tries of the spacetime. Therefore, any candidate state for a solution of the semiclassical 
Friedmann equations must be homogeneous and isotropic. That is, under reasonable 
assumptions, it must be a state of the form discussed in Sect. 6.2.2. Important exam¬ 
ples of homogeneous and isotropic states are the adiabatic states (Sect. 6.2.3) and 
the states of low energy (Sect. 6.2.5). 


7.3.1 Semiclassical Friedmann equations for the scalar field 

For the scalar field, the energy density and pressure are obtained from (7.3) and they 
read 

:p: = (| -O a+3 ^H 2 + \m 2 y.(p 2 : - )(p:, (7.7) 

3:p: = ((|-05 t 2 + (i-2?)n-?(6H + 9H 2 )-|m 2 ):cp 2 : - :(p (3 2 + §□) ip: 

with respect to cosmological time t. The expressions for conformal time t are given by 
the replacement d t >-» a~ 1 d T . A short calculation shows that the difference 3 :p: — :p : 
agrees with (7.4). 

The expectation values of :p: and :p: in a state to can again be calculated via 
Hadamard point-splitting. For the energy density this approach yields: 

w(:p:) = 9‘[w] - lim [(d 2 - |n)w] - -^K] 

x'^x z 4tt (7.8) 

- qm 4 + 3 c 2 m 2 H 2 - 6(3c 3 + c 4 )(2HH - H 2 + 6 H 2 H), 
where we have used the differential operator 

^=(i-?)3 2 -(|-On+3 m 2 + \m 2 

and the renormalization constants c t are again the same as in Sects. 7.2.3 and 7.2.4. 
The coincidence limit of the Hadamard coefficient v 1 on FLRW spacetimes can be 
obtain from (6.6): 

2 [vj] = im 4 -3(i-?)(H + 2H 2 )m 2 + 9(i-?) 2 (H 2 + 4HH 2 + 4H 4 ) 

(7.9) 

_X ( HH 2 + H 4 ) + i (!-?)□*. 

The point-split expression for co( :p :) will not concern us here and is left as an exercise 
to the reader. 

The next step is to replace the Hadamard point-splitting prescription with some¬ 
thing a that is slightly more useful under the given circumstances. 
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7.3.2 Adiabatic regularization 

An effective means of regularizing in momentum space is provided by the adiabatic 
regularization [50, 173], which is essentially equivalent to the Hadamard point¬ 
splitting regularization discussed above. In the adiabatic regularization prescription 
one subtracts from a homogeneous and isotropic state to the bidistribution specified 
by the adiabatic modes Wf^ of sufficiently high order n. The usefulness of this 
prescription lies in the fact that the bidistributions constructed out of the adiabatic 
modes satisfy the microlocal spectrum condition in the Sobolev sense up to a certain 
order. Consequently, the two regularization prescriptions can only differ by local 
curvature tensors because both the Hadamard parametrix and the adiabatic modes 
are constructed from the local geometric structure of the spacetime. 

It is therefore necessary to find the difference between point-splitting and adia¬ 
batic regularization 

lim a [ne n {x,X ')- * — - f w[ m) (T)wf (tO at) (7.10) 

y-x V (27t)- i a(T)a(T / ) J r3 J 

for various (bi) differential operators 9J) on C°°(M x M) and the minimal orders n, m 
depending on the order of 26. Note that we omitted the necessary e-regularization in 
the integrand. 

It is helpful to note that the (truncated) Hadamard parametrix on FLRW is spatially 
isotropic and homogeneous and therefore t' , 5 c') = 7<?(t, t', |jc — x'l). This 

fact can simplify some computations because the coincidence limit can be taken in 
two steps: first one takes the limit onto the equal time surface t = t' and then the 
spatial coincidence limit 5c' —* 5c. Making efficient use of the equation of motion, this 
has the advantage that we can replace any higher than first order time derivative 
in 26 by a spatial derivative and one needs to calculate the temporal coincidence limit 
only with the differential operators d T , <91 and d d'. A proof of this statement can be 
found in [196, Chap. 5], 

The computation (7.10) can be done in a general and efficent manner with a 
computer algebra system by combining the method of Avramidi to calculate Hadamard 
coefficients (Sect. 1.4.4), the coordinate expansion of the world function (Sect. 1.4.5) 
and analytic Fourier transformation. See also [83] for a related approach or [196] 
for a different method that makes more efficient use of the symmetries of FLRW 
spacetimes. 

The difference between point-splitting and adiabatic regularization for the Wick 
square can be calculated in this way as 


lim 

x'—>x 



x')- 


1 

(2tt) 3 a 2 





R 

288tt 2 ’ 


(7.11) 


where we used the potential B = m 2 + (£ — |)R and y denotes Euler’s constant. 

The exact form of the mode subtraction performed in the adiabatic regularization 
is inessential as long as it has the right k-asymptotics (c/ [196, Chap. 5]). For example, 
instead of subtracting the adiabatic modes of order zero in (7.11), one can perform 
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the following subtraction (c/ [178, 3] and Sect. 8.1.4): 



g(t?)a(q) 2 

16 , it 2 a 2 


(2 , n) 3 a 2 
B 


cor 




2m fc (i7) 4 m fc (rj) 3 


Mx-x') d £ 




16u 2 


2y + In 


A 2 a(q) 2 B(q)^ 


2a 2 J + 288 ti 2 ’ 


jR 


(7.12) 


where rj is an arbitrary instant in conformal time. But this realization is much more 
important once one attempts to calculate the adiabatic subtraction with several 
derivatives such as the energy density, where higher order adiabatic modes need to be 
subtracted. Here one generically finds that the adiabatic regularization involves terms 
with higher derivatives of the metric that do not become singular in the coincidence 
limit, i.e., terms which do not need to be subtracted to achieve the regularization. 

To find an adiabatic subtraction for other components of the stress-energy tensor 
one can follow the approach of Bunch [50] and take the second order adiabatic 
modes but discard all (non-singular) terms involving higher than fourth order time- 
derivatives of the metric. Comparing this subtraction scheme with the Hadamard 
point-splitting regularization for the energy density, we find (see also [114]) 


lim 

x'—>x 


a(T)a(T / )7if 1 (x,x / ) ■ 


(2Tt) 3 a 2 


d Hx- 




V2 a 2 

r 2 t j lt2 


4 fc 


m 4 a 2 + 12(£ - ±)m 2 a 2 H 2 + 36(^ - ±) 2 (6H 2 H - H 2 + 2 HH) 

16 k 3 


4tt 


1 H 4 r a 2 ^ 

Kl - (2-2 r + ln— 


m 


960tt 2 


2A 2 J 64tt 2 


1^2 tt2t 


( f hi a 2 ^\m 2 H 2 3tt-- 6 YH 2 R 

+ I 1 + 18(£- l)(2-2r + \n- '' 1 6 


2A 2 7 J 96n 2 


8n 2 


+ 


K- 


a 

[2 — 2 y + In 


17280tt 2 288tt 2 32tt 2 


—) 
2X 2 ) 


l 0 

J o > 


d fc 


(7.13) 


where the (bi) differential operator 


3 

2a 4 2S p = a 2 ( m 2 + (1 — 6£)H 2 ) Kl id + 2(6£ — 1 )aHd T IEI id + d r IEI ^ d x i Kl d x i 

i = 1 

can be derived from (7.7). In the case of conformal coupling it is in fact sufficient to 
work with adiabatic modes of order zero for this computation and many of the terms 
in the above formula drop out. 

Observe that both the point-splitting regularization with the truncated Hadamard 
parametrix and the adiabatic regularization do not depend on arbitrarily high deriva¬ 
tives in the metric. Consequently, it is possible to perform both regularization schemes 
in non-smooth spacetimes. This will be essential in Chap. 8. 
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Solutions of the semiclassical Einstein eq. 

Introduction 

If one wants to attach any physical meaning to the semiclassical Einstein equa¬ 
tion (7.1), it is necessary that solutions of this equation exist and that it possesses a 
well-posed initial value problem. It is not difficult to show that solutions do indeed ex¬ 
ist at least in two very special scenarios: Minkowski spacetime and de Sitter spacetime. 
In both cases the semiclassical Einstein equation (or, alternatively, the semiclassical 
Friedmann equations) can be solved for a specific choice of the renormalization 
constants. 

Solutions of the semiclassical Friedmann equations were investigated already 
numerically by Anderson in a series of four articles beginning with the massless 
conformally coupled scalar field [12, 13] and later also considering the massive field 
[14, 15]. Anderson discovered a complex landscape of solutions depending on the 
choice of renormalization constants and studied in particular solutions which show 
an asymptotically classical behaviour at late times. 

More recently, Pinamonti discussed the local existence of solutions to the semi¬ 
classical Friedmann equations in so-called null Big Bang (NBB) spacetimes [178], 
where initial values are specified on the initial lightlike singularity. 

In [3] the author and Pinamonti proved for the first time the existence of global 
solutions to the semiclassical Einstein equation coupled to a massive, conformally 
coupled scalar field in ‘non-trivial’ spacetimes. More precisely, it was shown that 
the semiclassical Friedmann equations can solved simultaneously for the spacetime 
metric (i.e., the scale factor or the Hubble function) and a quantum state from initial 
values given at some Cauchy surface. This was achieved by showing existence and 
uniqueness of local solutions for given initial values and subsequently extending local 
solutions to a maximal solution that cannot be extended any further because it exists 
either eternally or reaches a singularity. In this chapter, a slightly updated version of 
the results of [3] will be presented and complemented with recent numerical results. 

In more generality, solving the semiclassical Einstein equations for a given quan¬ 
tum field means the following: 

Given initial values for a spacetime metric and a quantum state prescribed on a three- 
dimensional Riemannian manifold (27, fr), do there exist a globally hyperbolic manifold 
(M, g) of which (27, h ) is a Cauchy surface and a state to (preferably a Hadamard state) 
such that the semiclassical Einstein equation (7.1) is fulfilled? 

In a concrete case this problem can be tackled by selecting a class of globally 
hyperbolic spacetimes that are foliated by the same topological Cauchy surface and 
a functional that associates to each spacetime in this class a unique states. For 
this approach to succeed, it is expected that the mentioned functional must satisfy 
some minimal regularity conditions with respect to the metric, e.g., continuous 
differentiability. 
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8.1 Preliminaries 


8.1.1 The traced stress-energy tensor 

Recall from Sect. 7.2.4 that the expectation value of the traced stress-energy tensor 
for a massive, conformally coupled scalar field reads 

<d(:T:) = -m 2 co{:(p 2 \ ) -1-~ [v^ - (6c 3 + 2 c 4 )DR (8.1) 

471 

m 2 1 a n 

= + —r[ v t] +4c 1 m 4 -c 2 m 2 R- (6c 3 + 2c 4 )IHR, 

8ti z 4n z 

where the Hadamard coefficient [v 3 ] is obtained from (7.9) with E, = 1/6 as 


H 2 . . 1 nr 

[vil =-(H + H 2 )- DR 4-. 

L 1J 60 V 2 720 8 


Working with the traced stress-energy tensor : T : simplifies calculations considerably 
compared to the energy density :p: given by (7.8). 

In order to find solutions of the semiclassical Friedmann equation with the 
methods discussed here, it is necessary to fix the renormalization constants according 
to the following rules: 

We will choose c 3 , c 4 in such a way as to cancel higher order derivatives of the 
metric coming from [v 3 ]. Following [218] and [217, Chap. 4.6], this is necessary 
because we want to have a well-posed initial value problem for a second-order 
differential equation. Removing the DR term might not be suitable for describing the 
physics close to the initial Big Bang singularity. In the Starobinsky model this term is 
the single term which is considered to drive a phase of rapid expansion close to the 
Big Bang, see the original paper of Starobinsky [201], its further development [141] 
and also [112, 114] for a recent analysis. However, this is surely suitable to describe 
the physics in the regime where DH <C H 4 . 

Furthermore, remember that changing c 3 corresponds to a renormalization of the 
cosmological constant A, whereas a change of c 2 corresponds to a renormalization of 
the Newton constant G ( cf. Sect. 7.2.3). For this reason we choose c x in such a way 
that no contribution proportional to m 4 is present in a>(: T :) and we set c 2 in order to 
cancel the terms proportional to m 2 R in a>(:T:). All in all, we fix the renormalization 
constants as 


4q = - 


1 

32u 2 ’ 


1 

Co =-and 6c 3 + 2c 4 = 

2 288tt 2 4 4 


1 

2880u 2 ' 


8.1.2 The semiclassical Friedmann equations 

We can rewrite the semiclassical Friedmann equations to make use of the simplicity 
of the traced stress-energy tensor for the conformally coupled scalar field: Adding 
the equations (7.6) (for flat FLRW spacetimes) yields 


—6(H + 2H 2 ) = o(: T :) — 4A. 


(8.2) 
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Since :T: = 3:p: — : p:, this equation is equivalent to (7.6) if we also specify an 
initial value p 0 = o>( : p: )(t 0 ) for the expectation value of the energy density at a 
time t 0 : 

3H 2 = 3H(t 0 ) 2 = Po + A. (8.3) 

We call (8.3) the constraint equation because it relates the initial value H 0 for the 
spacetime geometry with the initial value p 0 for its matter content; these values 
cannot be fixed independently. 

Inserting (8.1) into (8.2) and solving for H we thus find 


H = 


(H 4 - 2 H 2 c H 2 - y m 4 + 240 u 2 (m 2 m( : (p 2 :) + 4/1)), 


H 2 - H 2 


(8.4) 


which can integrated in conformal time to give 

rT a(r,) 


H(t) = H 0 + 


(H(rj) 4 - 2H c 2 H(r?) 2 - fm 4 


t 0 — H(r)) 2 


(8.5) 


+ 240Tt 2 (m 2 co(: (p 2 :)(rj) + 4/1)) dq, 


where H 2 = 1440u 2 /(8tiG) = 180tt/G. This integral equation will be our principal 
tool to solve the semiclassical Einstein equation. 


8.1.3 A choice of states 


As discussed in the introduction, a possible approach to solving the semiclassical 
Einstein equation is to select a class of candidate spacetimes and then for each of these 
spacetimes a unique state. Here we restrict ourselves to the semiclassical Friedmann 
equations as given by (8.2) and (8.3), viz., the candidate spacetimes are flat FLRW 
spacetimes. It remains to find a functional that associates to each flat FLRW spacetime 
a suitable state. 

It would be desirable to associate to each spacetime a Hadamard state. In the 
literature there are a few concrete examples of such states but unfortunately none 
of them are suitable for our purposes. On FLRW spacetimes there is the notable 
construction of states of low energy discussed in Sect. 6.2.5, which are also Hadamard. 
But the employed construction is based on a smearing of the modes with respect to 
an extended function of time and a priori we do not even know if a solution of (8.5) 
exists in the future of the initial Cauchy surface. The holographic constructions of 
Hadamard states, discussed in Sect. 6.3, requires that the spacetime has certain 
asymptotic properties which are not under control for generic FLRW spacetimes. 

Moreover, for technical reasons to be discussed later, we also have to consider 
spacetimes with C 1 metrics. But on spacetimes with non-smooth metrics Hadamard 
states cannot exist. Instead we will use the construction of adiabatic states of order 
zero as presented in Sect. 6.2.4, which is also applicable to spacetimes with low 
regularity. The price we have to pay for working with non-Hadamard states is that 
the solutions of (8.5) are not be smooth spacetimes. 

We recall, that the states constructed in Sect. 6.2.4 are of the form 


co 2 (x,x') = 


(2u) 3 a(T)a(T / ) 


XkMXkt t )e* ' 


A ji-(x-x') 


dk 


( 8 . 6 ) 
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with modes Xk = ^(t) given by 







r T 

J T 0 


sin (fc 0 (rj - t)) 




where k^ = k 2 + a(ir 0 ) 2 m 2 and V(t) = m 2 (a(T) 2 — a(T 0 ) 2 ). Note that Xk and a) 2 can 
be defined in this way even if the scale factor a is only C 1 . This may be confirmed by 
taking a closer look at Prop. 6.2. 


8.1.4 Adiabatic regularization of the Wick square 

The integral equation (8.5) does not contain the two-point function but instead only 
its smooth part w in the coincidence limit, i.e., Hadamard point-splitting has to 
applied to (8.6). 

The equation (8.5) that we seek to solve contains the Wick square :<p 2 : in a 
state to and thus (on a smooth spacetime) we would need to compute 

to(:<p 2 :)= lim (co 2 (x,x')— — ^c^ 4 + c 2 m 2 R. (8.7) 

x'—>x 

Since we are on a FLRW spacetime, we can use the method of adiabatic regularization 
instead (Sect. 7.3.2) to perform an equivalent subtraction on the level of modes. The 
difference of the two approaches is given in (7.11) or, equivalently, (7.12). It is useful 
to show directly that this regularization prescription indeed regularizes the two-point 
distribution (8.6): 


Proposition 8.1. The regularized two-point distribution 


co 2 (r,x — x 0 — lim 


e — * 0 + (2tt) J a(T) 2 



V(t)~ 
< . 


e i Hx-x') e -sk d % 


with to 2 given by ( 8 . 6 ), converges in the coinciding point limit for all continuously 
differentiable scale factors a. 


Proof. We have to show that 

d k (8.8) 


oo n 

n=0 1=0 

in terms of the order n. Inserting this expansion into (8.8), we can prove the statement 
order by order: 


lim 

£—» 0 + 




e~ sk d k = 


\Xk\ 2 ~ 


2 kn 


V 


is finite. To this end we expand the product \Xk\ 2 with Xk = 2 n X k as 


Oth order. Since x k X k = C2fc 0 ) 1 , the first term in the subtraction exactly cancels 
the zeroth order term \x k \ 2 i n (8.8). 
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1 st order. Using an integration by parts, we can rewrite the first order terms as 


(*°*fc + *k*°XT)= 7a 

K 0 


sin (k 0 (p — t)) cos (k 0 (p — t)) V(p) dp 


TO 


or- 2 


sin 


(2fc 0 (p-T))V(p)dp 


T 0 

V(t) 1 
4fc 0 3 4fc 0 3 , 




cos (2k 0 (p — T))V / (p)dp. 


T 0 


While the first summand V(t)(4/Cq) 1 in the last line is exactly cancelled by the 
second term in the subtraction in (8.8), the second summand yields 


r t 




cos (2k 0 (p — t))V / (tj) dp e £fc dk 


r 00 7.2 


= Tt 


= Tt 


= Tt 


r T 


k 3 \ 

o K o 


cos (2fc 0 (p — t))V / (p) dp le ek dk 


To 


r 00 

fc oV 1_a2fc o 


C T 


a 0 m 


cos (2fc 0 (p — T))V / (p)dp e £fc dk 0 


T 0 


/■ 00 


np) 


k 0 1 cos (2fc 0 (p - t)) e efc dfc 0 ] dp—R( t) (8.9) 


a 0 m 


for e > 0. Here P is a finite remainder term since it contains terms in the k 0 - 
integration which decay at least like k^ 3 . Notice that, in the last equation of the 
previous formula, thanks to the positivity of e we have switched the order in which 
the k 0 - and p-integration are taken. We would like to show that the weak limit 
e —* 0 + can be taken before the p-integration in (8.9). 

To this end it remains to be shown that the k 0 -integral in (8.9) converges in the 
limit e —» 0 + to an integrable function in [t 0 , t]. First, note that the exponential 
integral 


E 1 (x) = r(0,x) = 


r°° e -xt 


r i 


dt = 


x — ln(l — s) 


■ ds 


( 8 . 10 ) 


converges for x ^ 0, Rex > 0. To show the identity, we used the substitution 

t = —x _1 ln(l — s) + 1 

involving a subtle but inconsequential change of the integration contour in the 
complex plane if x is complex. Then we easily see that 


f 00 


lim 

£^0 + 


k -l e ±2ik 0 ^-^-ek 0 ^ = ^ 2iaom ( T? _ T )) ( 8 .11) 


a 0 m 


converges for p^t. This result is related with (8.9) via 


lim 

£—>0 + 


a 0 m 


k Q 1 cos (2k 0 (p — t)) e ek dk 0 = lim 

e—»0 + 


k 0 1 cos (2k 0 (p - t)) e £k ° dk 0 , 


a 0 m 
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where we have used the boundedness of k — ( k 2 — a 2 m 2 ) 1 ^ 2 and (8.11). Finally, a 
bound sufficient to see the p-integrability of the fc 0 -integral in (8.9) can be obtained 
from the identity in (8.10), namely, 


|£i(ix)| = 


(ix — ln(l —s)) 1 ds 


< 


(x 2 + s 2 ) 1//2 ds = in 


1 + 


+ x" 


2nd order. For the second order we calculate 


(z° xl + xl xl + Xk I°)(t) 

1 rT 


k 3 

K o 


sin 


(k 0 (ri - t))V(p) 


t 0 


rv 


sin (k 0 (£ - rj))V(^)cos (fc 0 (? - t)) dE, 


T 0 


1 

+ 2 


/■T 


rz 


k 3 
K 0 J 


rv 


To 
r t 


9 k 4 
ZK o 


sin (k 0 (? - t))V(^)cos (k 0 (? - p)) d£ ) dp 

To 

sin(fc 0 (rj-T))V(rj) j 
sin(fc 0 (q-T))V(rj) 
cos (fc 0 (q — t))V(p) dp, 




sin (k 0 (2E, — r\ — t)) V(£)d£ dp 

^0 y 

r v 

cos (fc 0 (2£ - p - T))y / (^)d^ 


( 8 . 12 ) 


where we have used integration by parts in the last equality. It is easy to obtain a 
/c-uniform estimate for the integral above and thus the integrability of the second 
order follows from J r3 k^ 4 d k < 00 . 


Higher orders. For orders n > 2 it is sufficient to use the rough estimate from (6.20): 


1 


n =3 1=0 


1 “ 2 n / 1 f T , . 


dp 


4 

< — 


0 n =3 
fT 


(T-p)|v(p)|dp 


n—3 


^0 


0 W 


r T 


|v(p)| dp exp 2 (t — p) |v(p)| dp . (8.13) 


As above, the integrability of the higher orders follows from J r3 fc“ 4 d k < 00 . 

Note that none of the estimates above depends on higher derivatives of the 
scale factor. Therefore, combining these partial results, we see that the thesis holds 
true. □ 


It follows that we can consistently define the renormalized Wick square of the 
state given by (8.6) at conformal time t for every FLRW spacetime with C 1 scale 
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factor 


co(:(p 2 :) = 


(271 ) 3 a 2 


\xiX 


i 

2 k~ 0 


+ 


m 


(4ti) 2 


Qq 

a( t) 


y 

4fcf. 


+ 2 In 


dk 



(8.14) 


which coincides with (8.7) for smooth spacetimes. Moreover, we notice that, as a 
consequence of the previous proposition, it is possible to obtain global estimates for 
the renormalized Wick square: 

Proposition 8.2. The renormalized Wick square is bounded on every a' e C[t 0 , Tj] 
with a > 0 in [t 0 , Tj] for every t 1 and with a( t 0 ) = a 0 , namely, 


|m(:(p 2 :)(Ti)| < C f sup a, sup a', (tj - t 0 ), —- 

[ T oAli 


where C is a finite increasing function. 


Proof. The proof of this proposition and the explicit value of C, can be obtained 
combining (8.7) with (7.12) and then analyzing the adiabatic subtraction (8.8) order 
by order as in the proof of the preceding proposition. □ 


8.1.5 Adiabatic regularization of the energy density 

There is another nice feature about the states we have constructed above. Thanks 
to the conformal coupling of the scalar field with the curvature, the energy density 
computed in these states is finite even though these states are (on smooth spacetimes) 
only adiabatic states of order zero. This is a crucial feature which permits us to solve 
the constraint (8.3) as a first step towards solving the semiclassical Einstein equation. 

Proposition 8.3. The energy density p in the state co defined by (8.6) at the initial time 
t = t 0 is finite. 


Proof. Following [114], in order to show that p( t 0 ) is finite, we just need to show 
that the adiabatically regularized expression (see also (7.13) and the subsequent 
remark) 

r oo 

Ulx'f + (k 2 + m 2 a 2 f Xk \ 2 ) - (K°f + (k 2 + m 2 a 2 )|w® | 2 )) k 2 d k (8.15) 

Jo 


does not diverge at t = t 0 . Evaluating the expression (8.15) at t = t 0 gives 

r 00 a 2 [a') 2 
0 ( k 2 + m 2 aIf ! 2 


m 

~8~ 


k 2 dk < oo. 


□ 


Notice that the previous proposition only guarantees that the energy density is 
well-defined at the initial time. Nevertheless, the conservation equation for the 
stress-energy tensor permits to state that it is well-defined everywhere. 
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The expression (8.15) coincides with the energy density p of the system up to 
a conformal rescaling and up to the addition of some finite terms. Thus, since the 
energy density p is finite in the considered state, the constraint (8.3) holds, provided 
a suitable choice of H(t 0 ) and A is made. We stress that, if we do not want to alter 
A, the same result can be achieved adding classical radiation to the energy density of 
the universe in a suitable state. 

We would like to conclude this section with a remark. In adiabatic states of 
order zero the expectation values of local fields containing derivatives are usually 
ill-defined. Despite this, in the case of conformal coupling and for our choice of initial 
conditions (6.18), the energy density turns out to be well-defined. This is essentially 
due to the fact that in the massless conformally coupled case the adiabatic modes of 
order zero are solutions of the mode equation (6.10) and in that case the obtained 
state is the well known conformal vacuum. Hence, the adiabatically regularized 
energy density vanishes. In the massive case the states constructed above are not very 
different than the conformal vacuum and, in particular, the energy density remains 
finite under that perturbation. 


8.2 Local solutions 


Our aim is to show the existence and uniqueness of local solutions to the semiclassical 
Friedmann equation. In particular, according to the discussion in the introduction, 
we will analyze the uniqueness and existence of solutions of (8.5). Similar to the 
Picard-Lindelof theorem, we will use the Banach fixed-point theorem to achieve this 
goal. Some results on functional derivatives and the Banach fixed-point theorem are 
collected in Chap. 3, in particular Sect. 3.4. 

Solving (8.5) is equivalent to finding fixed-points of the functional F defined by 


F(H)(t) = H 0 + 


= H n + 


a(rj) 


H 2 - H(r?) : 


-(H(rp) 4 -2H 2 H(rp) 2 -f 


m 


+ 2407t 2 (m 2 <D(:^ 2 :)(H)(rj) + 4A)) dtp 
r T 


/(H)(1)) dtp 


(8.16a) 


(8.16b) 


1^0 


Since co{\(p 2 \){H) is well-defined for continuous Hubble functions (see also (8.14)), 
we select for the Banach space of candidate Hubble functions H the space 1 C[t 0 , 
t 0 < Ti, equipped with the uniform norm 


imicDo.n] = imioo = sup ix(t)|. 

're[T 0 ,Ti] 

However, once t 0 and the initial condition a 0 = q(H)(t 0 ) > 0 are fixed, we find that 


a(H)( t) = a 0 




l-a 0 


H(rj)dtj 


(8.17) 


1 Until fixed, we take both t 0 and t 1 as variable and thus consider a family of Banach spaces. 
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as a functional of H, is not continuous on C[t 0 , Tj]. But we can find an open subset 

^[t 0) Ti] = {HeC[To.Tx] | ||H|| c[TojTi] < min{a(To) _1 (Ti - T 0 r\H c }} (8.18) 

on which a and thus also V = m 2 {a 2 — afy depend smoothly on H. Indeed, we can 
show the following: 

Proposition 8.4. The functional 

/(H) = H 2^2 ( H4 ~ 2H c h2 - f m4 + 240tt 2 (m 2 m(:y 2 :)(H) + 471)) (8.19) 

is continuously differentiable on Tl [t 0 , t x ] for arbitrary but fixed t 0 , and a 0 = a( t 0 ). 


Proof. Given (8.7), (7.12), Prop. 8.1 and Prop. 8.2, it is enough to show that a(H) 
and (H 2 — H 2 )” 1 are bounded and that o>(:(/ 2 :)(H)(t 0 ) is continuously differentiable. 
The former is assured by the condition ||H|| C [ T T ] < min{ag 1 (T 1 — t 0 ) _ 1 ,H c } in the 
definition of Tl [t 0 , t x ]. For the latter it remains to be shown that the renormalized 
Wick square (8.14) is continuously differentiable on TL\ t 0 , t J: 

We start by calculating the functional derivative of the scale factor 




da(H; 5 H)(t) = a(H)(r) 2 


5H(p) dp. 


The functional derivatives for a~ 2 , In a, V and V' follow easily. In particular we 
note that all these functions are continuously differentiable on Tl [t 0 ,T!] because 
integration is a continuous operation and a depends smoothly on H in Tl[ T 0 , t{\. 
Therefore it suffices to analyze the differentiability of the integral (8.8) appearing 
in the regularized two-point distribution. Moreover, within y k only the potential 
V is (smoothly on Tl\ t 0 ,tJ) dependent on H, thus simplifying the computations 
considerably. 2 Continuing with the regularized two-point distribution order by order 
as in Prop. 8.1, we have: 


1st order. Since 

1 f T 

= “73 cos(2k 0 (t?-T))dV / (H;5H)(p)dp, 

4/c o Jt 0 

we can proceed with the proof as in Prop. 8.1 with V 1 replaced by dV' and differen¬ 
tiability follows. 

2nd order. As above, this part of the proof can be shown by replacing occurrences of 
V and V' in (8.12) of Prop. 8.1 with dV and dV' respectively. 

2 If we were to work in cosmological time as in [178], we would also have to consider the functional 
dependence of conformal time on the scale factor. 
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Higher orders. For orders n > 2 we can again use an estimate similar to (6.20) to 
obtain a result analogous to (8.13): 


d 22**** 1 


^n =3 1=0 


\ 

4 f f 

J(H;5H)(t) 

K 0 


r T 


|dV(H;5H)(rj)|drjJ 
|v(rj)| d r) I exp 2 (t — rj) |v(i 7 )| dp I . 


^0 


^0 


In this way we can conclude the proof of the present proposition. 


□ 


We can now formulate the main theorem of this chapter: 

Theorem 8.5. Let (a 0 ,H 0 ),a 0 > 0, |H 0 | < H c , be some initial conditions fixed at t 0 
for (8.5). There is a non-empty interval [t 0 , Tj] and a closed subset U c C[t 0 , t x ] on 
which a unique solution to (8.5) exists. 

Proof. In Prop. 8.4 we showed that / is continuously differentiable on Tl [t 0 , t-J 
for any t x . Using Prop. 3.11, we can thus find a tj > t 0 and a closed subset 
U c Tl [t 0 , Tj] such that F(L7) c U. It then follows from Prop. 3.10 that F has a 
unique fixed point in U. □ 


Notice that the solution provided by the previous theorem is actually more regular, 
it is at least differentiable. Thus the corresponding spacetime is C 2 and has well- 
defined curvature tensors. The extra regularity is provided by (8.5) and can be 
easily seen when it is written in its differential form (8.4). It might be surprising 
that the solutions are not smooth, since the procedure to find the solution involves 
repeated integration, but because the chosen adiabatic state is only guaranteed to be 
continuous on every spacetime, H is only C 1 . Using Cor. 3.6, one can see that a more 
regular state immediately improves also the regularity of the solution. 


8.3 Global solutions 

In this section we would like to show that it is always possible to extend a ‘regular’ 
local solution up to the point where either H 2 becomes bigger than H 2 or a diverges. 0 
To this end we start giving a definition we shall use below. 

Definition 8.6. A continuous solution H* of (8.5) in the interval [t 0 , t x ] with initial 
conditions 

a(H*)( t 0 ) = a 0 , H*(t 0 ) 2 = H 2 = - (p( t 0 ) + A) 

will be called regular, if no singularity for either a, H* or H' is encountered in [t 0 , t {\. 
Namely, H* must satisfy the following conditions: 

2. a 0 JJ H*(r?) drj < 1 for every t in 


3 H 2 = H 2 corresponds to a singularity in the derivative of H in (8.4). 
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We remark that a local solution obtained from Thm. 8.5 is a regular solution. 
Henceforth, assume that we have a regular solution H, r as described in the definition. 
Notice that condition a) ensures that no singularity in H' is met in [t 0 , t 1 ]. Condition 
b), on the other hand, ensures that a does not diverge in the interval [t 0 , t 2 ]. 
Moreover, both a) and b) together imply that a is strictly positive, as can be seen 
from (8.17). 

We would like to prove that a regular solution can always be extended in C [t 0 , t 2 ] 
for a sufficiently small t 2 — t x > 0. To this end, let us again consider the set 


^I>i,t 2 ] = {He C[t 1; t 2 ] ||H|| c[Ti T2] <min{a 1 1 (T 2 -T 1 ) 1 ,H C }| 


defined in (8.18) and where dj = a(H*)(T 1 ) is the value assumed by the solution 
a(H*) at t 1 . Now we can give a proposition similar to Prop. 8.4, namely: 


Proposition 8.7. Let H* be a solution of (8.5) in C[t 0 , t x ] which is also regular. The 
functional /(H) of (8.19), when evaluated on regular extensions of H* in TL[ t 13 t 2 ], is 
continuously differentiable for arbitrary t 2 > t 1 . 


Proof. The proof of this proposition can be obtained exactly as the proof of Prop. 8.4. 
However, the estimates we have obtained in Prop. 8.2 and the proof of Prop. 8.4 
cannot be applied straightforwardly because the state to depends on the initial time t 0 
and the initial datum a 0 through the construction described in Sect. 6.2.4. Moreover, 
the estimates of Prop. 8.2 depend on the knowledge of a and a' on the whole interval 
[t 0 , t 2 ]. Luckily enough, we know that the solution H* is regular in [t 0 , Tj], while 
we know that the extension restricted to [t 15 t 2 ] is in the set TL[t 1 , t 2 ]; thus we just 
need to use the following estimates 


INIc[t 0 A 2 ] = maX {H a llc[T 1; T 2 ]’ INIc[T 0/ri ]}» 

ll a llcH 0 ,, 2 ]= m ^{ll a llr4r 2 ]’ll a llcLr 1 ]}’ 

= max { H a/ Ilc['r 1 ,'r 2 ]a ll“ / |lc[T 0 ,'r 1 ]}- 

With this in mind, we can again use Prop. 8.2 to control the boundedness of co(:<p 2 :). 
Then, making the replacements t 0 —* T l5 t 1 —* t 2 and a 0 —* aq at the appropriate 
places in Prop. 8.4, one can see that estimates are not substantially influenced and 
that thesis still holds for TL [tq, t 2 ]. □ 


Notice that it is always possible to fix t 2 such that o/ 1 (t 2 — iq ) -1 > H c , whereby 
TL [ti, t 2 ] becomes the set of all possible regular extensions of H* in [tq, t 2 ]. This 
guarantees that any extension in TL\t 1 , t 2 ] is the unique regular extension. 

We are now ready to state the main theorem of the present section which can be 
proven exactly as Thm. 8.5. 

Theorem 8.8. Consider a solution H*(t) in C[t 0 , tq] of (8.5). If the solution is regular 
in [t 0 , t^], as defined in Def 8.6, then it is possible to find a t 2 > t 1 such that, the 
solution H* can be extended uniquely to C[t 0 , t 2 ] and the solution is regular therein. 

Proof. Thanks to Prop. 8.7, / is continuously differentiable on all regular extensions 
of H* in TL[r lt t 2 ] for any t 2 such that a i V 2 - Ti) 1 > H c . With the remarks of 
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the proof of Prop. 8.7 we can use Prop. 8.2 to estimate the boundedness of a>(: (p 2 :) 
and apply Prop. 3.11 to find a t 2 > tj and a closed subset U c % [t 1s t 2 ] such that 
F(L7) c U. It then follows from Prop. 3.10 that F has a unique fixed point in U. □ 

We study now all possible solutions of (8.5) which are defined on intervals of the 
form [t 0 , t), which are regular on any closed interval contained in their domain and 
which enjoy the same initial values a 0 = a(T 0 ),H 0 = H(t 0 ). 

Proposition 8.9. A maximal solutions exists; it is unique and regular. 

Proof. Let S = {I a ,H a } aeA , with A c N some index set, be the set of all possible 
regular solutions H a with domain I a for the same initial values. By the existence 
of local solutions <S is not empty. We then take the union I = U aeA l a and define 
H(t) = H a ( t) for t e I a , which is a well-defined regular solution by Prop. 8.7. Since 
every I is a superset of every I a , H is the unique maximal regular solution. □ 

As for the solution provided by theorem 8.5, also the maximal solution obtained 
above correspond to a metric with C 2 regularity. 

8.4 Numerical solutions 

The first problem that one encounters when attempting to treat the semiclassical 
Einstein equation in a numerical fashion, is the construction of states. Here, the mode 
equation (6.10), which describes an oscillator with a time-dependent ‘resonance 
frequency’ a> 2 = k 2 + a 2 m 2 , has to be solved. Standard numerical solvers, like 
the Runge-Kutta method, rely on differentiation and their error scales like a (high- 
order) derivative of the solution. However, each derivative of an oscillating function 
increases the amplitude by a power of the frequency, thus ultimately leading to large 
errors for quickly oscillating differential equations after a short time span. This 
problem can be partially counteracted by choosing ever smaller step sizes in time, 
but eventually one will encounter a computational barrier. Another possibility is 
to look for a non-standard approach to solve the mode equation. Such methods 
replace differentiation with integration, but, since the numerical integration of highly 
oscillatory functions is also a non-trivial problem, this is still an active area of research, 
see for example [84, 131, 132, 143, 144]. 

One might say, that the large frequency behaviour of the modes is of no relevance 
when solving the semiclassical Einstein equation because it involves the state only 
after regularization, i.e., after the terms that contribute for large oj k have been 
subtracted. While this response is, to some degree, certainly true, numerical errors in 
the solution before the regularization and in the subtraction itself can still accumulate. 
Therefore, the issue of the high frequency modes and their regularization has to be 
carefully addressed in a numeric approach. 

Although the perturbative construction of the state used in this chapter (see also 
Sect. 6.2.4), the functional (8.5) and the use of the Banach fixed-point theorem in 
the proof of Thm. 8.5 were not developed with a numerical application in mind, there 
are reasons why they might be useful also for numerics: The mode solution are found 
recursively from (6.19), an integral equation which avoids the differentiation problem 
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discussed above. Moreover, as seen in Prop. 8.1, only the first two partial modes x k 
and are affected by the regularization in a well-understood way, so that also the 
numerical difficulty in the regularization can circumvented. Nevertheless, also this 
approach is not without its problems as it involves repeated numerical integration 
of oscillating functions and therefore it is very slow if naively implemented as a 
Riemann sum, because it requires small time steps. The other feature of the proof of 
existence that allows a translation to numerics is the use of the Banach fixed-point 
theorem. Namely, we can be assured that an iterated recursive application of (8.5) 
will converge to a solution, even though we do not know how quickly convergence 
occurs. 


8.5 Outlook 


In this chapter we have studied the backreaction of a quantum massive scalar field 
conformally coupled with gravity to cosmological spacetimes. We have given initial 
conditions at finite time t = t 0 and we have shown that a unique maximal solution 
exists. The maximal solution either lasts forever or until a spacetime singularity is 
reached. 

In order to obtain this result, we have used a state which looks as much as possible 
like the vacuum at the initial time. Notice that it is possible to choose other classes of 
states without significantly altering the results obtained in this chapter. In particular, 
if we restrict ourself to Gaussians pure state which are homogeneous and isotropic, 
their two-point function takes the form 


1 

^ 2 0>.y)= lim + 7^-TT 
e->o + (2tt ) d 



Zki'ty') e ik.(x-y) e -ek ^ 


where E, k are solutions of (6.10) which enjoy the Wronskian condition (6.11). These 
X k can then be written as a Bogoliubov transformation of the modes %k studied earlier 
in this chapter, namely, 

Zk =A(k)xk + B(k)J k 


for suitable functions A and B. Then, because of the constraint |A| 2 — |B| 2 = 1, the 
difference 


a)(:ip 2 :) — o)(:(p 2 :) = lim 


2 . 1 - 


o + (2tt) 3 a z 


(IB 


XkXk ' 


B-e{ABx k Xk ))e ek dk 


can be easily controlled employing (6.20) if |B| is sufficiently regular (e.g., if B(k) is 
in L 2 n L 1 ). 4 With this observation it is possible to obtain again all the estimates used 
in the proofs of Thms. 8.5 and 8.8. 

In the future, it would be desirable to study the semiclassical equations in more 
general cases, namely for more general fields, abandoning for example the conformal 
coupling, and for more general background geometries. The results presented here 
cannot straightforwardly be extended to fields which are not conformally coupled 
to curvature or to spacetimes that are not conformally flat because in that case 


4 A detailed analysis of this problem is present in [221]. 
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fourth order derivatives of the metric originating in the conformal anomaly cannot 
be cancelled by a judicious choice of renormalization parameters, i. e ., Wald’s fifth 
axiom [218] cannot be satisfied. To still solve the semiclassical Einstein equation 
with methods similar to those presented here, a deeper analysis of the states is 
required, in particular, one needs states of higher regularity. A preliminary study 
in this direction can be found in a paper of Eltzner and Gottschalk [83], where 
the semiclassical Einstein equation on a FLRW background with non-conformally 
coupled scalar field is discussed. The case of backgrounds which are only spherically 
symmetric is interesting from many perspectives. Its analysis could give new hints 
on the problem of semiclassical black hole evaporation and confirm the nice two- 
dimensional results obtained in [19]. Finally, the limit of validity of the employed 
equation needs to be carefully addressed in the future. 


Induced semiclassical fluctuations 


9 


Introduction 

As described in Chap. 7, in semiclassical Einstein gravity one equates a classical 
quantity, the Einstein tensor, with the expectation value of a quantum observable, 
the quantum stress-energy tensor, i. e ., a quantity with a probabilistic interpretation. 
Such a system could make sense only when the fluctuations of the quantum stress- 
energy tensor can be neglected. Unfortunately, as also noticed in [178], the variance 
of quantum unsmeared stress-energy tensor is always divergent even when proper 
regularization methods are considered. The situation is slightly better when a smeared 
stress-energy tensor is analyzed. In that way, however, the covariance of (7.1) gets 
lost. A possible way out is to allow for fluctuations also on the left-hand side of (7.1). 
This is the point of view we shall assume within this chapter, which is based on an 
article [4] in collaboration with Pinamonti. 

More precisely, we interpret the Einstein tensor as a stochastic field and equate 
its n-point distributions with the symmetrized n-point distributions of the quantum 
stress-energy tensor. As an application of this (toy) model, we analyze the metric 
fluctuations induced by a massive, conformally coupled scalar field via the (quantum) 
stress-energy tensor in the simplest non-trivial spacetime - de Sitter spacetime. We 
find that the potential in a Newtonianly perturbed FLRW spacetime has a almost 
scale-invariant power spectrum. 

These results encourage a comparison with the observation of anisotropies in 
the cosmic microwave background and their theoretical explanations. Anisotropies 
in the angular temperature distribution were predicted by Sachs and Wolfe [188] 
shortly after the discovery of the cosmic microwave background (CMB) by Penzias 
and Wilson [174]. In their famous paper they discuss what was later coined the 
Sachs-Wolfe effect : The redshift in the microwave radiation caused by fluctuations 
in the gravitational field and the corresponding matter density fluctuations. In the 
standard model of inflationary cosmology the fluctuations imprinted upon the CMB 
are seeded by quantum fluctuations during inflation [161, 162], see also the reviews 
in [76, 80], 

The usual computation of the power spectrum of the initial fluctuations produced 
by single-field inflation can be sketched as follows [26, 76, 80]: First, one introduces 
a (perturbed) classical scalar field ip + 5ip, the inflaton field, which is coupled to a 
(perturbed) expanding spacetime g + 5g. Then, taking the Einstein equation and the 
Klein-Gordon equation at first order in the perturbation variables, one constructs an 
equation of motion for the Mukhanov-Sasaki variable Q = 5(p + <p ] 4>, where <P is 
the Bardeen potential [25] and H the Hubble constant. Q is then quantized 1 (in the 
slow-roll approximation) and one chooses as the state of the associated quantum field 


*A recent discussion about the quantization of a such system can be found in [81]. 
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a Bunch-Davies-like state. Last, one evaluates the power spectrum Pq(/c) of Q, i.e., the 
Fourier-transformed two-point distribution of the quantum state, in the super-Hubble 
regime k <C aH and obtains an expression of the form 2 


PqW = 


Aq 

k 3 



(9.1) 


where Aq is the amplitude of the fluctuations, k 0 a pivot scale and n s the spectral 
index. Notice the factor of k~ 3 in (9.1) which gives the spectrum the ‘scale-invariant’ 
Harrison-Zel’dovich form if n s = 1. Depending on the details of model, n s < 1 and 
there is also a possibility for a scale dependence of n s - the ‘running’ of the spectral 
index n s = n s (k). 

This result can then be related to the power spectrum of the comoving curvature 
perturbation 31, which is proportional to Q, and can be compared with observational 
data. Assuming adiabatic and Gaussian initial perturbations, the WMAP collaboration 
finds n s = 0.9608±0.0080 (at k 0 = 0.002 Mpc -1 ) in a model without running spectral 
index and gravitational waves, excluding a scale-invariant spectrum at 5cr [121]. 
Furthermore, the data of WMAP and other experiments can be used to constrain the 
deviations from a pure Gaussian spectrum, the so called non-Gaussianities, that arise 
in some inflationary models [26, 33, 150], 

In [7-9, 170] concerns have been raised whether the calculation leading to (9.1) 
and similar calculations are correct: The authors argue that the two-point distribution 
of the curvature fluctuations has to be regularized and renormalized similarly to what 
is done in semiclassical gravity. As a result the power spectrum is changed sufficiently 
that previously observationally excluded inflation models become realistic again. 
On the contrary the authors of [77, 153] argue that the adiabatic regularization 
employed in [7-9, 170] is not appropriate for low momentum modes if evaluated 
at the Horizon crossing and irrelevant for these modes if evaluated at the end of 
inflation. 

A slightly different approach to the calculation of the power spectrum based on 
stochastic gravity can be found in [129, 184, 185]. In spirit similar to the approach 
presented in this chapter, the authors equate fluctuations of the stress-energy tensor 
with the correlation function of the Bardeen potential. In the super-Hubble regime 
they obtain an almost scale-invariant power spectrum. Moreover, they discuss the 
equivalence of their stochastic gravity approach with the usual approach of quantizing 
metric perturbations. 

Our approach here is strictly different from the standard one described above. 
Instead of quantizing a coupled system of linear inflaton and gravitational pertur¬ 
bations, we aim at extending the semiclassical Einstein equation to describe metric 
fluctuations via the fluctuations in the stress-energy tensor of a quantum field. 


9.1 Fluctuations of the Einstein tensor 

Consider now the Einstein tensor as a random field. Then we could imagine to equate 
the probability distribution of the Einstein tensor with the probability distribution of 


2 An alternative definition of the power spectrum is S^ffe) = (2tt) 2 fc 3 P Q (fc). 
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the stress-energy tensor. This suggestion, however, seems largely void without a pos¬ 
sibility of actually computing the probability distributions of the stress-energy tensor 
because, as discussed above, its moments of order larger than one are divergent. 

Instead we may approach this idea by equating the hierarchy of n-point distribu¬ 
tions of the Einstein tensor with that of the stress-energy tensor: 

(GabOt)) = ^(^abOi):)* (9.2a) 

(5G ab (xi)5G c / d /(x 2 )) = :5T c / d /(x 2 ): + :5T c , d /(x 2 ): 

(9.2b) 

and 


(5G m ) = co(Sym(:5T: m )), n> 1, (9.2c) 

where co is a Hadamard state and we defined 

<5G ab = G ab - {G ab ) an d :5T ab : = :T ab : - co(:T ab :). 

The symmetrization on the right-hand side is necessary because the classical quantity 
on left-hand side is invariant under permutation. 

We emphasize that we are equating singular objects in (9.2c). Having all the 
n-point distributions of the Einstein stochastic tensor, we can easily construct an 
equation for the moments of the smeared Einstein tensor which equals the moments 
of a smeared stress-energy tensor by smearing both sides of (9.2) with tensor products 
of a smooth compactly supported function. This smearing also automatically accounts 
for the symmetrization in (9.2). 

Furthermore we stress that equating moments, obtained smearing both side 
of (9.2), is not equivalent to equating probability distributions. Although it is also 
possible to arrive at a description in terms of moments when coming from a probability 
distribution, the inverse mapping is not necessarily well-defined. Successful attempts 
to construct a probability distribution for smeared stress-energy tensors can be found 
in [90, 91]. 

Consider now a quasi-free Hadamard state co of a conformally coupled scalar 
field y on a spacetime (M, g)> the background spacetime. Our aim is to calculate the 
perturbation of the background spacetime as specified by the correlation functions on 
the left-hand side of (9.2) due to the fluctuations of the stress-energy in the quantum 
state co as specified on the right-hand side of (9.2). In particular we will require that co 
satisfies (9.2a) when we identify the Einstein tensor of the background spacetime G ab 
with (G ab ) ( cf. Chap. 8 for a discussion of the solutions of the semiclassical Einstein 
equation in cosmological spacetimes). Note that by choosing this Ansatz we are 
completely ignoring any backreaction effects of the fluctuations to the background 
metric and evaluate the stress-energy tensor on a state specified on the background 
spacetime. 

Later on we consider perturbations of the scalar curvature induced by a ‘Newtoni- 
anly’ perturbed FLRW metric. For this reason it will be sufficient to work with the 
trace of (9.2) (using the background metric) instead of the full equations. With the 
definition 

s — -g G ab , 
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Figure 9.1. A few graphs illustrating (9.3c) for n = 2, n = 3 and n = 5. 

such that R= (S), the equations (9.2) simplify to 

m 2 1 

(S(x 1 )) = —5-[tv]- ? [vi] +ren. freedom, (9.3a) 

8u z 4 n z 

(SCx^SCxz)) - (S(x 1 ))(S(x 2 )) = m 4 (co 2 (x 1 ,x 2 ) + co 2 (x 2 ,x 1 )), (9.3b) 

and 

((S - (S))® n (x 1 ,...,x„)) = 2 n m 2n Sym [XiFl 2 In’ (9.3c) 

V r ij A ij■ J 

where the sum is over all directed graphs F with n vertices 1 with two arrows 
at every vertex directed to a vertex with a larger label. e [0,1,2} is the number of 
arrows from i to j. If we perform the symmetrization in (9.3c), we see that the sum 
is over all acyclical directed graphs with two arrows at every vertex. For illustration 
some graphs are shown in Fig. 9.1. 

To obtain (9.3b) and (9.3c), note that \(p 2 \ — co{\(p 2 \) does not depend on the 
choice of normal ordering 3 and thus only (9.3a) needs to be renormalized. Therefore 
we may choose normal ordering with respect to co 2 to see that the combinatorics 
are equivalent to those in Minkowski space. Moreover, as co 2 is a bisolution of the 
Klein-Gordon equation, the term |: (pV(p\ which causes the trace anomaly in (9.3a) 
(i cf. Sect. 7.2.4) does not contribute to the higher moments. 


9.2 Fluctuations around a de Sitter spacetime 

We shall now specialize the general discussion presented above to Newtonianly 
perturbed, exponentially expanding, flat FLRW universes. That is, the background 


3 Indeed this holds true if we replace t p 2 with Lip 2 , for any linear operator L. 
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spacetime (M, g) is given in conformal time t < 0 by the metric tensor 
g = (Ht) - 2 (—dT <8) dr + <5 i; - dx ! 8) dx ; ) 


and we consider fluctuations of the scalar curvature derived from metric perturbations 
of the form 

g = (Ht ) -2 ( — (1 + 2<P ) dx 8 ) dx + (1 — 2<P ) dx ! 8 > dx ; ). (9.4) 

The kind of fluctuations that we consider by choosing (9.4) resemble those that 
are present in single-scalar field inflation in the longitudinal gauge, where there 
are only ‘scalar fluctuations’ without anisotropic stress (so that the two Bardeen 
potentials coincide) [80, 162]. Notice that, for classical metric perturbation, these 
constraints descend from the linearized Einstein equation, however, a priori there 
is no similar constraint in (9.2b). Despite these facts, we proceed analyzing the 
influence of quantum matter on this special kind of metric perturbations and we 
also refrain from discussing the gauge problem associated to choosing a perturbed 
spactime; the chosen perturbation potential <P is not gauge invariant. 

We can now calculate the various perturbed curvature tensors and obtain in 
particular 

S = 12H 2 (1 — 3<F) + 24H 2 t 3 t <F — 6H 2 t 2 3 2 £ + 2H 2 t 2 V 2 ^ + 0(<P 2 ) 


for the trace of the perturbed Einstein tensor, where V 2 = < 9 2 , + < 3 2 2 + d\ is the 
ordinary Laplace operator. Dropping terms of higher than linear order, this can also 
be written as 

S-(S) = — 6H 2 t 4 (3 t - | V 2 ) t“ 2 4>, (9.5) 

where (S) = 12H 2 is nothing but the scalar curvature of the background spacetime. 
Notice that, up to a rescaling, the operator on the right-hand side of (9.5) looks like a 
wave operator with the characteristic velocity equal to l/\/3 of the velocity of light. 

We can now evaluate the influence of quantum matter fluctuations on the met¬ 
ric fluctuations by inverting the previous hyperbolic operator by means of its re¬ 
tarded fundamental solutions A ret and applying it on both sides of (9.3b) and (9.3c). 
From (9.3b) we can then (formally) obtain the two-point correlation functions of $ 
(per definition (£) = 0): 


($(x 1 )<P(ix 2 )) = m 4 


AretC*!, Ji)A ret (x 2 , y 2 ) O l&i, y 2 ) 


(9.6) 


+ ^2^2>yi)) d 4 Ji d 4 y 2 - 


Employing the retarded fundamental solutions in the inversion without adding any 
solution of (9.5), we are implicitly assuming that all the n-point distributions of the 
perturbation potential <P are sourced by quantum fluctuations. Here we are only 
interested in evaluating their effect. 


9.2.1 The squared two-point distribution 

In order to proceed with our analysis, we shall specify the quantum state to for the 
matter theory. Following the Ansatz discussed in the preceding section, we choose a 
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quasi-free Hadamard state which satisfies the semiclassical Einstein equation on the 
background. In particular, we require that co solves (9.3a), namely 


12 H 2 


m 


8tt^ 


M ~ + 4c 1 m 4 -c 2 m 2 R 

4tt z 


The right-hand side of the previous equation is characterized by three contributions: 
The state dependent part [w], the anomaly part [vx], which takes the simple form ( cf. 
Eq. (7.9)) 


K] = - 


H 4 

60 


m 


4 


8 ’ 


and the renormalization freedom c 2 nt 4 and c 2 m 2 R. Here we set c 2 = 0, because 
we assume the point of view that we have already measured Newton’s gravitational 
constant and do not wish to renormalize it ( cf. Sect. 7.2.3). That is, we have 


12 H 2 


m 


8tr 


[w] 


1 

8tt 2 




+ 4c 1 m 4 . 


For the semiclassical Einstein equation to hold, we therefore have to require that 
[w] is a constant. Then, having fixed H and m (no matter their absolute value), 
there is always a choice of c 1 for which the chosen metric g and co satisfy the 
semiclassical Einstein equation. On a de Sitter spacetime these criteria are satisfied 
by the Bunch-Davis state, cf. Sect. 6.2.1. 

In order to evaluate the influence of the quantum matter fluctuations on <P via 
equation (9.6), we have to discuss the form of the two-point distribution of the chosen 
state and its square. Any Hadamard state on (M,g) can be written is equal to the 
Bunch-Davies state up to smooth terms. In particular, the two-point distribution co 2 
of every Hadamard state on de Sitter spacetime is of the form 


co 2 (x,x') = lim 


H A 


TT 


o + 4n 2 (x — x'f 2 + 2ie(z — t') + e 2 


+ less singular terms, (9.7) 


where we write (x — x') 2 = — (t — t ') 2 + (x — x') 2 and, as always, the limit e —» 0 + 
is a weak limit. It is no surprise that the leading singularity is conformally related 
to the two-point distribution of a massless scalar field on Minkowski spacetime, we 
denote it by a> M . Thus it is also clear that the less singular contributions vanish in the 
limit of zero mass. 

As can be seen in (9.3b), we need to compute the square of the two-point 
distribution of the state in question. For our purposes it will be sufficient to compute 
the square of the leading singularity in the Hadamard state. 4 The square of the 
massless two-point distribution on Minkowski space is 


co m (x,x / ) 2 = lim 
£^ 0 + 


1 1 

4tt 2 (x — x') 2 + 2ie(r — t') + e 2 


2 


4 Note that, in (spatial) momentum space, the leading singularity in (9.7) contributes the smallest 
inverse power of the momentum fc; the ‘less singular terms’ correspond to higher inverse powers of k. 
Accordingly, these terms fall off faster for large k. This is nothing but the usual relation ship between 
high momenta and short distances. 
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Writing a> M in terms of its spatial Fourier transform, an expression for the spatial 
Fourier transform of the square of the massless Minkowski vacuum can be obtained 
as 


m M (x,x0 2 = lim 


iHx-x') e- ip ^-^e- £ P dpdk. 


*o + 128tt 5 

Later on we will use this expression in order to obtain the power spectrum of <P. 


(9.8) 


9.2.2 Power spectrum of the metric perturbations 

We want to compute the power spectrum P(t, k) of the two-point correlation of 4> at 
the time t. Since both the spacetime and the chosen state are invariant under spatial 
translation, it can be defined as 

<S(T, x) <Kt, * 0 ) = 7-^3 f P(T, k) dk. 

To obtain P, we first need an expression for the retarded operator A ret corresponding 
to (9.5): 


(A ret /)( T,x) = 
A ret (T,Ti,k) A 


rz 


(2tt) 3 

1 t 2 /3 


A ret (T, T i> k)f(T 1 , k)e^' x dT x d/c, 
■ sin {k (t - t^/a/3) , 


with 


where / is a compactly supported smooth function. We can then rewrite (9.6) in 
Fourier space to obtain 

r Z r Z 


P(t, k) = 2m 4 


A ret (T, t, k)A ret (T, t', k)m 2 (t, t', fc) dT dr'. 


•/ —oo •/ —oo 


Note that the symmetrization of the state is taken care of indirectly by the equal limits 
of the two integrations. 

As discussed above (see (9.7) and the following paragraph), we will compute the 
contribution due to the leading singularity of the Hadamard state: 


r T 


4^4 


P 0 ( T, k) = 2 H 4 m 




A ret (T, t, k) A ret (ir, t', k) t 2 t /2 o> 2 (t i, t', k) dr dx'. 


We emphasize at this point that, because of the form of (9.8) and of A ret , no infrared 
(with respect to k) singularity appears in P 0 (t, k) at finite t. Recall also that the error 
we are committing, using P 0 (t, k) at the place of P(t, k), tends to vanish in the limit 
of small masses. Inserting the spectrum of obtained in (9.8) and switching the 
order in which the integrals are taken (for e > 0), we can write 


P 0 (t, k) = lim 


m 


o + 16 tc 2 


k 4 


A(T,k/V3,p) 


-sp 


dp, 


(9.9) 


where we have introduced the auxiliary function 


A(t,k,p) A 


r z 2 

K T Z 


-00 T 1 


sin (jc (t — Ti)) e lpTl dz 1 , 


(9.10) 
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which can also be written in closed form in terms of the generalized exponential 
integral 5 E 2 as 

A(t,k,p)=A(kt,pt) = ~ kt ^E 2 (i(p + k) t) e 1KT -E 2 (i(p - k)t) e“ 1KT ) 

for p > k > 0 and by the complex conjugate of this expression if k > p, k > 0. In the 
following study of the form of the power spectrum P 0 the auxiliary function A will be 
instrumental. 


Lemma 9.1. For \p\ ^ k > 0, A(t, K,p ) has the t -uniform bound 


|A|< 


4 k 2 



For large negative times it satisfies the limit 


lim |A| = 

T—> —00 



(9.11) 


(9.12) 


Proof. Using the fact that 




3 -ipri _ 


dTi 2 


K 


- 1 


2 2 ’ 

— p z 


we can perform two integrations by parts to obtain 

k 2 

A(t,k,p)= —--(e“ ipT + R(t,k,p)), with 


K 2 —p 2 
r t 


R(t,k,p) = T 2 


-oo V T 1 


4 6 \ 

■ cos (k(t — Tj)) -I-7 sin (k (t — t x )) e lpTl dT 1 . 


kt: 


It is now easy to obtain an upper bound for R which is uniform in conformal time, 
namely \R\ < 3, which then yields the bound (9.11). 

For the second part of the proposition we perform a change of the integration 
variable to x = t 1 /t: 


f 00 


R(t,k,p) = 


4 6 

cos (kt (1 — x)) -I-7 sin (kt (1 — x)) ] e _lpTX dx. 


„ v-vu, v ,x “ ^ J J ' a 

KTX^ 


The contribution proportional to 1/t inR is bounded by C(k)/|t| and thus vanishes 
in the limit t —» —oo. Moreover, since |p| k and 1/x 3 is L 1 on [l,oo), we can apply 
the Riemann-Lebesgue lemma and see that this contribution vanishes in the limit 
t —* —oo. The remaining part of |A[ is k 2 |k 2 — p 2 | -1 , which is independent of t, and 
thus the limit (9.12) holds true. □ 


Note that the bound for A obtained above is not optimal. Numerical integration 
indicates that |A| 2 is monotonically decreasing in t and thus bounded by the limit 
stated in (9.12) (see also Fig. 9.2). Nevertheless, we can use this lemma to derive 
the following bounds and limits for P 0 : 


5 For a definition and various properties of these special functions see e.g. [166, Chap. 8]. 
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Proposition 9.2. The leading contribution P 0 to the power spectrum of the potential 
4> induced by a conformally coupled massive scalar field in the Bunch-Davis state is 
bounded by the Harrison-Zel’dovich spectrum uniformly in time, namely 


16C 3 — 2\/3 arccoth \/3 

C =-^ - m 

|fc| 3 192tt 2 

and it tends to the Harrison-Zel’dovich spectrum for t —* — oo, i.e., 

C 

lim P 0 (T,k) = -^~. 

T^-°° |k | 4 


P 0 ( T ^)| < 


Proof. The proof can be easily obtained using the t- uniform estimate (9.11) obtained 
in Lem. 9.1 and computing the integral 




< 


m 


r oo 


Tt^ 


3p 2 - k 2 


2 3 — 2 V3 arccoth \/3 m 4 

dp = — 


12tt 2 


k 3 


Having shown the first part of the proposition, let us now analyze the limit 


lim P 0 (t, fc) = 


m 


r oo 


16ti 2 


1 

— lim 

fc 4 T—> — 00 


a(t,/c//3,p) 


dp, 


where we have taken the T-limit before the integral and already evaluated the e-limit 


because \A\ 2 is bounded by an integrable function uniformly in time. Inserting the 
limit (9.12) from Lem. 9.1, we can compute the p-integral 


lim P 0 (T,k) 

T—» — 00 


167t 2 


f 00 

(3p 2 -k 2 r 2 dp = 

Jk 


3 — 2\/3 arccoth /3 m 4 
192tt 2 k 3 ’ 


thus concluding the proof. 


□ 


We can complement the results of Prop. 9.2 with the following observation: 


Proposition 9.3. The power spectrum P 0 has the form 




where SP 0 is a function of \k\r only. 


ffoGfclO 

|k | 3 


Proof. Noting that A(t, /c,p) is a function of k t and p t only and performing the 
e-limit in (9.9) inside the integral, this can be seen by the substitution x = p t 
in (9.9). □ 

We would like to improve the estimate of P 0 (t, k) for t close to zero. Adhering to 
our previous strategy, we shall first give a new estimate for A(t, k,p): 


Lemma 9.4. The auxiliary function A(t, K,p ) is bounded by 


\a(t,k,p)\ 


< 


2 

K T 


IP I 


p/0, T < 0. 
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—# 

Figure 9.2. Logarithmic plot of the rescaled power spectrum 9\)(|k|T), where C is 
the same proportionality constant as in Prop. 9.2. 


Proof. Recalling the form of A given in (9.10) and integrating by parts, where we use 
that e _ip ' ri = i p^ 1 d Zi e~ lpz \ we find 


iK 2 T 2 


f T f 1 

2 A 

— COS (k(t - Ti)) + 

sin [k (t t 2 )) 

J-00 W 

kt\ J 


A(t,k, p) = 


We then take the absolute value and estimate the trigonometric functions, which 
gives us a bound on A, namely 


|a(t,k,p)| < 


2 2 
K Z T Z 


\p\ 


Ak-'^r 


dTi = —2 


K 2 T 

w 


□ 


Performing the integration in p analogously to the second part of proposition 
(9.2), the last lemma immediately leads to a corresponding bound for P 0 : 


Proposition 9.5. The leading contribution P 0 of the power spectrum 
satisfies the inequality 


P 0 ^,k)\ < 


m 


36tt 2 |fc| 


of the potential 


and therefore, in particular, P 0 (0, k ) = 0. 


The rescaled power spectrum S^o(l^l T ) can be analyzed numerically and a plot 
is shown in Fig. 9.2. It clearly exhibits the asymptotic behaviour of P 0 discussed in 
Props. 9.2 and 9.5. Note that the horizontal axis is logarithmically scaled to highlight 
the behavior of S ? 0 for small \k\z, which would be concealed by the fast approach of 
2^ 0 to its bound had we used a linear scaling. 

In this section we have used the leading singularity 6 of the two-point function of 
the Bunch-Davis state on a de Sitter universe to compute the influence of quantum 

6 Recall that considering only the leading singularity in the Bunch-Davis state also corresponds to 
the limit of vanishing mass. 
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matter on the power spectrum of the metric perturbation <£>. We have seen that this 
results in an almost scale-invariant power spectrum. We stress that such a singularity 
is not a special feature of the Bunch-Davis state but is common for every Hadamard 
state. Moreover, although our analysis has been done on a de Sitter universe, similar 
quantum states have been constructed on universes which are asymptotically de 
Sitter spaces in the past [61, 62]. All these states tend to the Bunch-Davis state for 
t —* —oo and are of Hadamard form. 


9.2.3 Non-Gaussianities of the metric perturbations 

It follows from (9.3c) that the n-point correlation for 4> will, in general, not vanish. 
Also for odd n they will be different from zero and hence £ is not a Gaussian random 
field. As a first measure of the non-Gaussianity of ^ one usually calculates its three- 
point correlation function or the corresponding bispectrum B: 


(<?(t, X 3 ) <Kt, X 2 ) <Kt, x 3 )) 


(2tt) 5 


5(fci + k 2 + fc 3 )S( t, fc l3 k 2 , fc 3 ) 

i 

x dk 2 dk 3 . 


Assuming nonzero k 1} k 2 and k 3 , we will derive the form of the bispectrum B 
considering (as above) only the contribution due to the leading singularity of the 
Bunch-Davis state, which we will denote by B 0 . We will follow the same steps that 
lead us to the calculation of the power spectrum in the previous section. That is, we 
apply the retarded propagator A ret of (9.5) as in (9.6) to the right-hand side of (9.3c) 
for n = 3 to obtain an equation for ^ and insert for the two-point distribution the 
conformally rescaled two-point distribution of the massless Minkowski vacuum. The 
result can again be expressed in terms of the auxiliary function A defined in (9.10): 

Q -e (&>^(-fci)+co ? (fc 3 )+|p|) 

cop(- fc 1 )C0£(k 3 )|p| 

X A(t, k 1 , cop(-ki) + IpI)A(t, k- 3 , —cop(k 3 ) - |p|) (9.13) 

x A(t, k 2 , copik^ — cop^—k^) + permutations dp, 


B 0 (T,k 1 ,k 2 ,k 3 )= lim 


m 


o + 32-/3kjfc2^3 


where xq = |/ V3, cop(k ) = |fc + p| and the sum is over all permutations of 1,2,3. 

We can apply the same bound on A which has been used in the previous section 
to bound the power spectrum P 0 to produce a bound on the integrand of B () almost 
everywhere. 7 Nevertheless, the singularity in the integrand in (9.13) is integrable, 
i.e., B 0 is bounded. As a consequence we can perform the limit e —* 0 + inside the 
integral. 


7 We cannot bound the integrand of B 0 everywhere using (9.11) because \k 2 \/V3 ^ | o>j(fc 3 ) — 
ojj(-fcj)| (and permutations) does not hold everywhere. 
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Proposition 9.6. The leading contribution B 0 of the bispectrum of the metric perturba¬ 
tion has the form 


£o( T > fc t, fc 2,k 3 ) = 


28 0 (fciT,fc 2 T,fc 3 T) 


r -2 t .2 t .2 
K 1 K 2 ' C 3 


where 2& 0 is a function o/kjT, k 2 z, and k 3 z only and /q = |k ; | 


Proof. Analogously to Prop. 9.3, we note that after a change of variables x = z p the 
integrand in (9.13) is a function of fc 1 T, k 2 z, and k 3 z only. □ 


To finish our discussion about non-Gaussianities, we notice that, although the 
employed quantum field is a linear one, we obtained a three-point function for <P 
which is similar to the one obtained by Maldacena [150] who has quantized metric 
perturbations outside the linear approximation. 


9.3 Outlook 

In this chapter the influence of quantum matter fluctuations on metric perturbations 
over de Sitter backgrounds were analyzed. We used techniques proper of quantum 
field theory on curved spacetime to regularize the stress-energy tensor and to compute 
its fluctuations. In particular, we interpreted the perturbations of the curvature tensors 
as the realization of a stochastic field. We then obtained the n-point distributions of 
such a stochastic field as induced by the n-point distributions of a quantum stress 
tensor by means of semiclassical Einstein equations. 

We also noticed that, while the expectation value of the stress-energy tensor 
is characterized by renormalization ambiguities, this is no longer the case when 
fluctuations are considered. Hence the obtained results are independent on the 
particular regularization used to define the stress tensor. 

In order to keep superficial contact with literature on inflation, we investigated 
perturbations of the scalar curvature generated by a Newtonian metric perturbation, 
which is related to the standard Bardeen potentials. However, the considered model 
is certainly oversimplified to cover any real situation and is not gauge invariant. 

Within this model it was possible to recover an almost-Harrison-Zel’dovich power 
spectrum for the considered metric perturbation. Furthermore, the amplitude of such 
a power spectrum depends on the field mass which is a free parameter in our model 
and can be fixed independently of H. At the same time, since it does not depend on 
the Hubble parameter of the background metric, this indicates that it is not a special 
feature of de Sitter space. At least close to the initial singularity, the obtained result 
depends only on the form of the most singular part of the two-point function of the 
considered Bunch-Davis state. We thus argue that a similar feature is present in every 
Hadamard state and for backgrounds which are only asymptotically de Sitter in the 
past. 

Finally we notice that, since the stress-energy tensor is not linear in the field, 
its probability distribution cannot be of Gaussian nature. Thus we showed that 
non-Gaussianities arise naturally in this picture. 



Conclusions 


In this thesis aspects of the backreaction of quantum matter fields on the curvature 
of spacetime were discussed. The main results in this direction were discussed in 
Chaps. 8 and 9: the existence of local and global solutions of the semiclassical Einstein 
equation on cosmological spacetimes, and the a coupling the fluctuations of the 
quantum matter field to a Newtonianly perturbed de Sitter spacetime. Further results 
presented are the enumerative combinatorics of the run structure of permutations 
in Chap. 4 with applications to the moment problem of the Wick square and the 
stress-energy on Minkowski spacetime. 

In each case the problems were not treated in all possible generality, mainly 
due to the difficulty of constructing Hadamard states on general globally hyperbolic 
spacetimes but also due to other factors. Nevertheless, we studied the effects of 
quantum fields on cosmological spacetimes not only because of their relative simplicity 
but also because of the relevance in cosmology. Therefore the first avenue is not 
always the generalization of results to more general spacetimes, but also the better 
understanding of possible effects on this restricted class of spacetimes. For example, 
we already mentioned in Sect. 8.5 that the results are restricted to the conformally 
coupled scalar field with a certain choice for the renormalization freedom as other 
choices can lead to equations involving higher than second-order derivatives of 
the metric and ask for slightly different approach. However, it would be desirable 
to understand this problem also for non-conformal coupling and discuss the full 
dependence of the solutions to the Einstein equation on the renormalization freedom. 

In the case of results on the metric fluctuations induced by quantum matter 
fluctuations as presented in Chap. 9, we were even more restrictive and the discussion 
is mostly based on the special case of a Newtonianly perturbed de Sitter spacetime. 
While straightforward generalization of this idea to asymptotically de Sitter space- 
times are possible and were already published in [60], the next step should be to gain 
a clearer physical and mathematical motivation of the used equations. A development 
in this direction is [75], but also this work should only be seen as a first step. In any 
case, as soon as one attempts to take into account the fluctuations of the stress-energy 
tensor one is faced with the limitations of the semiclassical Einstein equation and 
any attempts to generalize them, even if well-motivated, remains speculative in the 
absence of an accepted theory of quantum gravity. 

On one hand, when attempting to study quantum field theory in a mathematically 
rigorous fashion one sees even clearer the non-uniqueness of many constructions and 
one is confronted with many choices: Are all Hadamard states physically sensible? 
What topology should be chosen for the algebra of quantum fields? What is the 
appropriate gauge freedom for the electromagnetic potential on non-contractible 
spacetimes? Is it reasonable to work with an algebra of unbouded operators such as 
the field algebra or should one always use a C*-algebra? There are many more ques¬ 
tions of this kind and they require further mathematical and physical investigations 
but also intuition. Quantum field theory and the quest for a quantum gravity is and 
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will continue to be not only a research effort and playground of physicists but also 
one of mathematicians. 

On the other hand, many aspects of quantum field theory are now conceptually 
and mathematically very well understood but only a few models have been studied 
in all their detail. In particular interacting quantum fields on curved spacetimes have 
received relatively little attention given that already free fields are a complicated 
matter. Investigations of physically interesting interacting models, using perturbative 
techniques, are largely absent from the literature and deserve more attention. 

For these reasons one can expect that the field of quantum field theory (on curved 
spacetimes) will remain an interesting field of study for many more years to come. 
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velocity of a 12 

cyclic *-representation. 60 

cyclic order. 84 


cyclic vector. 60 

D 

d’Alembert operator. 26 

de Rham cohomology. 24, 25 

~ with compact support . . . 24, 25 

de Sitter spacetime. 43 

chart of ~. 43 

conformal time of ~ . 44 

cosmological chart of ~. 44 

metric of 43 

world function of ~. 44 

decomposition of a tensor field . . 48 

5-distribution. 77 

dense set. 52 

derivative 

covariant ~ along a 

~ curve . 21 

~ vector field. 19 

directional 62 

exterior ~. 23 

Frechet 64 

Gateaux ~. see direct, deriv. 

Lie ~. 11, 14 

spatial covariant 38 

temporal covariant ~. 38 

descending permutation. 85 

descent set. 86 

diagonal distribution. 77 

diffeomorphism. 9 

differential form. 23 

closed 24 

coclosed 25 

coexact ~. 25 

exact ~. 24 

differential geometry. 7-33 

differential of a smooth map .... 11 

differential operator. 17 

formally adjoint ~ . 25, 79 

formally self-adjoint ~. 80 

Green-hyperbolic ~. 79 

normally hyperbolic ~. 18 

pre-normally hyperbolic 18 

principal symbol of a ~. 18 

total symbol of a 18 

Dirac 5-distribution. 77 

direct sum topology. 53 
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direction 

regular rsj . 73 

singular 74 

directional derivative. 62 

discrete topology. 52 

distribution. 69 

compactly supported ~ . 69 

kernel of a 71 

multiplication of . 76 

pullback of a ~ . 75 

restriction of a 69 

Schwartz ~ see tempered distribution 

smooth 73 

support of a 69 

tempered 69 

wavefront set in a cone . 74 

wavefront set of a . 74 

distributional section. 70 

compactly supported ~ . 70 

restriction of a 70 

wavefront set of a rsj . 76 

dual 

~ frame. 16 

~ metric. 15 

~ of a Banach space. 57 

~ of a connection. 20 

~ of a Frechet space. 57 

~ of a vector bundle. 15 

~ of the cotangent bundle .... 11 

~ of the cotangent space .... 11 

~ of the tangent bundle. 13 

~ of the tangent space. 13 

~ seminorm. 57 

strong ~. 57 

topological 56 

weak rsj . 57 

duality. 56 

distributions and test functions . 73 

dual vector bundle. 15 

musical isomorphisms. 15 

Poincare 26 

E 

Einstein curvature tensor. 42 

Einstein equation. 42 

semiclassical ~. 131-137 

vacuum solution of ~. 43 


element of a *-algebra 

adjoint rsj . 59 

normal 59 

self-adjoint 59 

unitary ~. 59 

embedding. 12 

Cauchy 106 

conformal 16 

hyperbolic 106 

isometric ~. 16 

~ of a submanifold. 12 

energy condition. 41 

dominated (DEC). 41 

null (NEC). 41 

strong (SEC). 42 

weak (WEC). 41 

energy conservation. 41 

energy density. 41 

energy-density 

~ per mode. 125 

quantum ~. 135 

enumerative combinatorics . . 83-100 

equicontinuous. 56 

Euclidean space. 16 

chart of 16 

metric of 16 

event. 37 

exact differential form. 24 

existence of a fixed-point. 66 

expansion scalar. 38 

expansion tensor. 38 

exponential map. 22 

exterior derivative. 23 

exterior tensor product. 15 

F 

falling atomic permutation. 85 

fibre of a vector bundle. 9 

field algebra 

off-shell ~. 109 

on-shell 110 

field equation. 109, 116 

Klein-Gordon ~. 112 

Proca ~. 113 

field operator. 62 

final topology. 53 

finer topology. 52 


















































































Index 


189 


first Bianchi identity. 21 

first Friedmann equation . . . 46, 134 

first-countable . 53 

fixed-point 

existence of a 66 

regularity of a rsj . 66 

uniqueness of a . 66 

fixed-point theorem. 65-68 

Banach ~. 65, 148 

Lipschitz continuity criterium 67-68, 
148 

flat connection. 21 

flat b. 15 

flow of a vector field. 13 

local ~. 13 

FLRW spacetime. 45 

conformal metric of a ~. 47 

conformal time of a ~. 47 

cosmological time of a ~ .... 46 

metric of a 47 

perturbed metric of a ~. 49 

world function of ~. 32 

fluid 

barotropic 41 

perfect 41 

foliation. 27 

leaves of a 27 

formally 

~ adjoint. 25, 79 

~ self-adjoint. 80 

Fourier transform . 71 

convolution theorem. 73 

inverse ~. 72 

Plancherel-Parseval identity . . 72 
Frchet space 

smooth sections. 69 

Frechet derivative. 64 

Frechet space. 55, 64 

dual of a 57 

rapidly decreasing functions . . 69 

smooth functions. 68 

frame 

comoving ~ . 37 

components in a ~. 17 

dual 16 

local rsj 16 


orthogonal rsj . 17 

unitary ~. 17 

Friedmann equation 

first ~. 46, 134 

second ~. 46, 134 

semiclassical ~ . . . . 134-137, 140 

Frobenius’ theorem. 26 

global ~. 27 

funct 

topologies. 55-56 

function 

~ on a manifold. 9 

rapidly decreasing ~. 69 

space of 55 

space of continuous ~. 53 

test ~. 69 

future 

causal 36 

chronological . 36 

G 

Gateaux derivative . . see direct, deriv. 
gauge invariant tensor field .... 47 
gauge problem in cosmology.... 47 

Gauss-Codacci equation. 39 

Gaussian state ... see quasi-free state 

general relativity. 40^14 

geodesic. 22 

geodesic distance. 27 

geodesically 

~ complete. 22 

~ convex. 22 

~ starshaped. 22 

geometry 

differential ~. 7-33 

Lorentzian 35-50 

global cotangent map. 13 

global tangent map . 11 

global trivialization. 9 

globally hyperbolic spacetime ... 40 

category of 106 

GNS construction. 60 

graph topology. 60 

Green’s operator 

advanced ~ . . . see advanced prop, 
retarded ~ .... see retarded prop. 
Green-hyperbolic. 79 
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H 

Hormander topology. 74 

Hadamard 

~ coefficients. 118, 134, 135 

~ form. 117 

~ parametrix. 118 

~ point-splitting regularization 133, 
136-137 

~ recursion relations. 118 

~ state. 117 

Hahn-Banach theorem. 55 

Harrison-Zel’dovich spectrum . . 154 

Hausdorff space . 52 

Hodge star. 25 

homeomorphism. 53 

homogeneous and isotropic 

~ spacetime . see FLRW spacetime 

~ state. 121 

Hubble 

conformal ~ function. 49 

~ constant. 43 

~ function. 46 

hyperbolic differential operator 

Green-~. 79 

normally 18 

pre-normally IN/ . 18 

hyperbolic-embedding . 106 

I 

immersion. 12 

conformal 16 

isometric ~. 16 

~ of a submanifold. 12 

index notation 

abstract /\/ 19 

concrete ~. 18 

index of a metric. 16 

inextendible curve. 12 

initial topology. 53 

injective tensor product. 58 

integrable 

Lebesgue 71 

square-~ . 71 

integrable plane field. 26 

integral 

~ curve. 12 

~ manifold. 26 


integration on manifolds. 24 

involution 

~ of a *-algebra. 59 

~ of a plane field. 26 

isometric 

~ embedding. 16 

~ immersion. 16 

isometry. 16, 53 

conformal 16 

K 

kernel. 71 

properly supported ~. 71 

regular 71 

semiregular rsj . 71 

kernel theorem 

abstract 58 

Schwartz ~. 70 

Killing vector field. 22 

conformal 22 

Klein-Gordon field. 112 

field equation . 112 

stress-energy tensor of the ~ . 132 

Koszul formula. 20 


L 

Lagrangian quantum field theory 104 


Laplace-de Rham operator. 26 

leaves of a foliation. 27 

Lebesgue integrable. 71 

Levi-Civita connection. 20 

Lie derivative. 11,14 

lightlike . 36 

linear permutation. 83 

~ group. 83 

run structure of . 93 

Lipschitz constant. 64 

local flow of a vector field. 13 

local frame . 16 

local section. 10 

local trivialization. 9 

locally compact topological space . 54 

locally convex space. 55, 64 

Banach space. 55 

Frechet space. 55, 64 

nuclear 58 

tensor product. 57-58 
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locally covariant QFT. 107 

locally Lipschitz. 64 

Lorentzian 

~ manifold. 16 

~ metric. 16 

Lorentzian geometry. 35-50 

Lorentzian manifold 

causal structure of a ~ ... . 36-37 

covariant splitting of a ~ . . 37-39 

L p space. 71 

M 

Mackey topology. 57 

manifold 

contractible 24 

integral 26 

Lorentzian 16 

pseudo-Riemannian ~. 15 

Riemannian 16 

smooth i-wi. 3 

strongly causal 40 

topological rsj . 3 

maximal atlas. 8 

maximal integrable curve. 13 

metric .... see also pseudometric, 53 

dual 15 

index of a 16 

Lorentzian 16 

~ of a FLRW spacetime. 47 

~ of de Sitter spacetime. 43 

~ of Euclidean space. 16 

~ of Minkowski spacetime ... 16 

~ on a vector bundle. 15 

Riemannian 16 

metric connection. 19 

metric space . . see also pseudometric 
space, 53 

microlocal analysis. 68-78 

microlocal spectrum condition . 117 

minimal curvature coupling ... 112 

minimal regular norm. 61 

Minkowski spacetime. 16, 43 

chart of 16 

metric of 16 

Minkowski vacuum state. 121 

mixed state. 60 

mode equation. 121 


momentum conservation. 41 

momentum density. 41 

multiplication of distributions ... 76 

musical isomorphisms. 15 

flat b. 15 

sharp jj. 15 

N 

n-point distribution. 115 

truncated rsj . 116 

neighbourhood. 52 

coordinate f^/ . 3 

net of local algebras. 105 

isotony. 105 

norm. see also seminorm, 55 

minimal regular . 61 

operator ~. 60 

normal element. 59 

normal topology. 74 

normally hyperbolic. 18 

normed space. 55 

nuclear space. 58 

quotient of a 58 

subspace of a ~. 58 

O 

off-shell field algebra. 109 

on-shell field algebra. 110 

one-form. see covector field 

one-line permutation notation ... 83 

open set. 52 

operator norm . 60 

orientable. 24 

orientation. 24 

negative ~. 24 

positive ~. 24 

time-~. 36 

orthogonal frame. 17 


P 

parallel 


~ propagator. 29 

~ transport. 21 

parallel propagator 

transport equation of the ~ . . . 29 

parametrized curve. 12 

past 
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causal 36 

chronological . 36 

Pauli-Jordan distribution. see 

commutator distribution 

peak of a permutation. 86 

perfect fluid. 41 

permutation 

ascending 85 

atom of a 85 

atomic ~. 85 

circular 84 

descending 85 

linear 83 

one-line notation. 83 

peak of a ~. 86 

principal atom of a ~. 85 

two-line notation. 83 

valley of a ~. 86, 95-97 

permutation group 

circular 84 

linear 83 

perturbation 

scalar 49 

tensor ~. 49 

vector ~. 49 

perturbed metric of a FLRW spacetime 
49 

physical spacetime. 47 

Plancherel-Parseval identity .... 72 

plane field. 26 

integrable 26 

involution of a 26 

Poincare duality. 26 

point-splitting regularization . . . 133, 
136-137 

pointwise convergence. 55 

pre-normally hyperbolic . 18 

pressure. 41 

quantum ~. 135 

pressure-free matter. 41 

principal atom of a permutation . 85 

principal symbol. 18 

principle 

~ of causality. 105, 107 

~ of covariance. 106 

~ of locality. 105,106 


timeslice axiom. 107 

Proca field. 113 

field equation . 113 

product topology. 53 

projected symmetric trace-free part 38 

projective tensor product. 57 

propagation of singularities .... 78 

propagator 

advanced 79 

causal 80 

parallel ~. 29 

retarded 79 

proper map. 54 

proper time. 36 

properly supported kernel. 71 

pseudo-Riemannian manifold ... 15 

conformal symmetry of a ~ . . . 23 

symmetry of a 23 


pseudometric ... see also metric, 53 
pseudometric space . . see also metric 


space, 53 

bounded rsj . 54 

complete 54 

completion of a ~. 54 

pullback 

~ of a connection. 20 

~ of a covector field. 13 

~ of a distribution . 75 

~ of a section. 10 

~ of a vector bundle. 10 

~ of a vector field. 12 

pure state. 60 

pushforward 

~ of a covector field. 13 

~ of a section . 10 

~ of a vector field. 12 

Q 

quantum energy-density. 135 

quantum field. 109,111 

quantum field theory 

algebraic rsj . 105 

Lagrangian ~. 104 

locally covariant ~. 107 

quantum pressure. 135 

quantum state. see state 

quantum stress-energy tensor . . 132 
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quasi-free state. 116 

quotient bundle. 10 

quotient topology. 53 

R 

rapidly decreasing function .... 69 
Raychaudhuri 

~ equation. 43 

~ scalar. 38 

regular ^-representation . 62 

regular direction. 73 

regular kernel. 71 

regularity of a fixed-point. 66 

regularization 

adiabatic ~ . . . 136-137, 142-146 
Hadamard point-splitting <~s^ 133, 

136-137 

relative Cauchy evolution. 107 

renormalization of the stress-energy ten¬ 
sor . 132 

rest space . 37 

restriction of a 

~ distribution. 69 

~ distributional section. 70 

retarded propagator. 79 

Ricci 

~ curvature scalar. 21 

~ curvature tensor. 21 

Riemann curvature tensor. 21 

Riemannian 

~ manifold. 16 

~ metric. 16 

rising atomic permutation. 85 

run. 86 

run structure. 86 

~ of atomic permutation . . . 87-91 
~ of circular permutation . . 91-93 
~ of linear permutation. 93 


S 

scalar field ... see Klein-Gordon field 

scalar perturbation. 49 

Schwartz distribution . see tempered 
distribution 

Schwartz kernel theorem. 70 

secant numbers. 91 

second Bianchi identity. 21 


second Friedmann equation . 46, 134 

second-countable. 52 

section. 10 

components of a ~. 17 

local 10 

pullback of a 10 

pushforward of a ~. 10 

test ~. 70 

self-adjoint element. 59 

semiclassical Einstein equation 131-137 

solution of the ~ . 139-151 

~ global. 148-150 

~ local. 146-148 

~ maximal . 149 

~ numerical. 150 

~ regular. 148 

semiclassical Friedmann equations 134- 
137, 140 

seminorm. see also norm, 54 

dual ~. 57 

family of 55 

separating ~. 55 

semiregular kernel. 71 

separating seminorm. 55 

set 

closed 52 

closure of a 52 

connected 52 

dense 52 

open ~ . 52 

shadow, causal 36 

sharp D. 15 

shear tensor. 38 

cr-compact topological space .... 54 

simple *-algebra. 62 

singular direction . 74 

set of ~. 74 

set of localized 74 

singular support. 73 

smooth 

~ distribution. 73 

smooth atlas. 8 

smooth functions. 68 

smooth manifold. 8 

smooth sections. 69 

smoothly compatible charts. 8 
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space 

Euclidean 16 

Hausdorff ~. 52 

locally convex ~. 55, 64 

nuclear 58 

~ of compactly supported distribu¬ 
tional sections. 70 

~ of compactly supported distribu¬ 
tions . 69 

~ of continuous functions .... 53 
~ of distributional sections ... 70 

~ of distributions. 69 

~ of functions. 55 

topologies. 55-56 

~ of L p functions. 71 

~ of rapidly decreasing functions 69 

~ of smooth functions. 68 

~ of smooth sections. 69 

~ of tempered distributions ... 69 

~ of test functions . 69 

~ of test sections. 70 

total ~. 9 

spacelike. 36 

spacetime. 39 

anti-de Sitter ~. 43 

background rsj . 47, 155 

cosmological ~ see FLRW spacetime 

de Sitter ~. 43 

~ FLRW. 45 

globally hyperbolic ~. 40 

homogeneous and isotropic ~ . see 
FLRW spacetime 

Minkowski ~. 16, 43 

physical 47 

spatial covariant derivative. 38 

square-integrable. 71 

state. 60,115 

adiabatic ~. 119,122-127 

Bunch-Davies . 119 

Gaussian ~ . . . see quasi-free state 

Hadamard 117 

homogeneous and isotropic ~ 121 

Minkowski vacuum ~ . 121 

mixed 60 

~ of low energy. 125-127 

pure ~. 60 


quasi-free 116 

strongly regular . 62 

Stokes’ theorem. 25 

stress-energy tensor . . . 40,131-134 
~ of the Klein-Gordon field . . 132 

quantum ~. 132 

renormalization of the ~ ... 132 

trace of ~. 41, 134, 140 

strong 

~ dual. 57 

~ topology. 57 

strongly causal 

~ at a point. 40 

~ manifold. 40 

strongly cyclic ^representation . . 60 

strongly regular state. 62 

submanifold 

embedding of a ~. 12 

immersion of a ~. 12 

tangent space of a ~. 12 

subspace topology. 53 

summable sequence. 58 

absolutely 58 

summation convention. 17 

support 

~ of a distribution . 69 

singular 73 

symmetric tensor product. 14 

symmetry of a manifold. 23 

~ conformal. 23 

Synge bracket. 27 

Synge’s rule. 27 

Synge’s world function. 28 

transport equation of ~. 28 

T 

tangent bundle. 11 

dual of the ~. 13 

tangent map. 11 

global 11 

tangent numbers. 93 

tangent space. 11 

dual of the ~. 13 

~ of a submanifold. 12 

tempered distribution. 69 

temporal covariant derivative ... 38 
tensor field . 14 
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components of a rsj . 17 

contravariant ~. 14 

covariant ~. 14 

decomposition of a ~. 48 

gauge invariant ~. 47 

projected symmetric trace-free part of 

a ~. 38 

tensor perturbation. 49 

tensor product. 14 

antisymmetric ~. 14 

exterior ~. 15 

injective ~ topology. 58 

projective ~ topology. 57 

symmetric ~. 14 

test function. 69 

~ topology. 69 

test section . 70 

time 

conformal 47 

cosmological 46 

~ function. 36 

-orientation. 36 

proper ~ . 36 

timelike . 36 

timeslice axiom. 107 

topological 

>|c_ algebra. see *-algebra 

~ basis . 52 

~ dual. 56 

~ local 

~ topological local basis .... 52 

manifold. 8 

topological space. 52 

compact ~. 54 

locally compact . 54 

cr-compact ~. 54 

topological vector space . 54 

convex ~. 54 

locally convex ~. 55, 64 

topology. 52-58 

coarser ~. 52 

compact-open ~. 56 

direct sum 53 

discrete 52 

final ~. 53 

finer 52 


graph rsj 60 

Hormander 74 

initial 53 

injective tensor product ~ . . . . 58 

Mackey ~. 57 

normal 74 

~ of pointwise convergence ... 55 

~ of test functions . 69 

~ of uniform convergence .... 56 

~ in comp acta. 56 

product 53 

projective tensor product 57 

quotient ~. 53 

strong ~. 57 

subspace 53 

trivial 52 

uniform operator ~. 60 

weak 57 

torsion-free connection. 19 

total space. 9 

total symbol. 18 

trace anomaly. 134 

transition map . 8 

transport 

~ operator. 30 

parallel 21 

transport equation. 30 

~ of Synge’s world function ... 28 

~ of the parallel propagator . . 29 

~ of the van Vleck-Morette det 29 

trivial topology. 52 

trivialization 

global ~. 9 

local ~. 9 

truncated n-point distribution . . 116 

two-line permutation notation ... 83 

U 

uniform convergence. 56 

~ in compacta. 56 

uniform operator topology. 60 

uniformly continuous. 54 

uniqueness of a fixed-point. 66 

unital *-algebra. 59 

unitary element. 59 

unitary frame. 17 
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V 

valley of a permutation . . 86, 95-97 

van Vleck-Morette determinant . . 28 

transport equation of the ~ . . . 29 

vector. 11 

coordinate IN/ . 17 

cyclic 60 

vector bundle. 9 

antisymm. tensor product of a ~ 14 

dual of a 15 

exterior tensor product of a /N-l 15 

fibre of a in/. ^ 

homomorphism. 9 

metric on a ~ . 15 

~ projection. 9 

pullback of a ^Ni . 10 

quotient of a N/ . 10 

subbundle of a 10 

symmetric tensor product of a ~ 14 

tensor product of a ~. 14 

vector field. 11 

complete N/ . 13 

components of a ~. 17 

conformal Killing ~. 22 

flow of a 13 

Killing ~ . 22 

local flow of a rsj . 13 

pullback of a 12 

pushforward of a ~. 12 

vector perturbation . 49 


vector subbundle. 10 

velocity of a curve. 12 

volume form. 24 

~ induced by a metric. 24 

vorticity tensor. 38 


W 

wave 


~ equation. 78 

~ operator. 18 

wavefront set 

~ of a distribution . 74 

~ of a distributional section ... 76 
weak 

~ dual. 57 

~ topology. 57 

Weyl 

~ *-algebra. 61 

~ C*-algebra. 61 

~ generator . 61 

world function. 28 

~ of de Sitter spacetime. 44 

~ of FLRW spacetime. 32 

world line. 36 

acceleration . 38 

expansion scalar. 38 

expansion tensor . 38 

rest space. 37 

shear tensor . 38 

vorticity tensor. 38 
















































